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CRYSTALS AND STRUCTURE OF LuxS 



1. CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of priority of provisional application serial no. 
60/237,933, filed October 3, 2000, the contents of which are hereby incorporated by reference. 

2. FIELD OF THE INVENTION 

The present invention concerns crystalline forms of polypeptides that correspond to LuxS, 
methods of obtaining such crystals and the high-resolution X-ray diffraction structures and 
atomic structure coordinates obtained therefrom. The crystals of the invention and the atomic 
structural information obtained therefrom are useful for solving the crystal and solution 
structures of related and unrelated LuxSs, and for screening for, identifying and/or designing 
compounds that bind and/or modulate a biological activity of LuxS. The atomic structural 
information may also be used to design novel mutant forms of LuxS polypeptides. 

3. BACKGROUND OF THE INVENTION 

LuxS protein is involved in the production of autoinducer-2 (AI-2), an intercellular 
signaling molecule employed in the quorum sensing pathway of various bacteria (WO 
00/32152). These bacteria include pathogens such as Helicobacter pylori, Haemophilus 
influenzae, Campylorbacter jejuni, Salmonella typhimurium, Vibrio cholerae and Nesseria 
meningitidis. 

Intercellular communication between bacteria regulates several key bacterial functions, 
such as competence and sporulation in Bacillus, sporulation and motility in Myxococcus and 
quorum sensing in bacteria such as luminous Vibrio. Each communication system uses a 
different small extracellular signaling molecule, all of which are amino acid based. For instance, 
such signaling molecules include acyl-homoserine lactones (HSL), peptides, and a mixture of 
amino acids and fragments of peptidoglycan, respectively. V harveyi uses two independent cell- 
cell communication systems in controlling its luminescence expression. Signaling system 1 is 
highly species specific and uses a homoserine signal. System 2 is not as well characterized, but 
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appears to not be species-specific. It has been proposed that it employs 4,5-dihydroxy-2,3- 
pentanedione as its signal (PCT WO 00/32152). 

Quorum sensing bacteria synthesize, release and respond to specific autoinducers in order 
to control gene expression as a function of cell density. The quorum sensing pathways are 
important in the virulence of some bacteria. For example, Pseudomonas aeruginosa exists as 
biofilms in cystic fibrosis lungs. Quorum sensing via signaling system 1 is employed by P. 
aeruginosa to enable biofilm formation. Enterohemorrhagic E. coli (EHEC) is responsible for 
hemorrhagic colitis and hemolytic uremic syndrome. Enteropathogenic E. coli (EPEC) is 
responsible for infant diarrhea. Virulence in these E. coli bacteria is controlled by expression of 
the type in secretion system of which LuxS is a crucial component. LuxS thus provides a target 
for novel compounds that can be used to treat or prevent virulent infections of microorganisms 
such as EPEC. 

Until the present invention, the ability to obtain the atomic structure coordinates of LuxS 
has not been realized. Crystals of LuxS and the atomic structure coordinates of LuxS would 
enable further study of the LuxS protein. Significantly, the atomic structure coordinates of LuxS 
would enable the design and selection of antibiotics that target bacterial strains whose 
pathogenesis depends on the quorum sensing pathway. 

4. SUMMARY OF THE INVENTION 

In one aspect, the invention provides crystalline forms of polypeptides corresponding to 
LuxS of the LuxS Family type. The LuxS Family include proteins represented by SWISSPROT 
accession numbers Q9ZMW8 , 024931, Q9XDU6, 034667, D75280, Q9Z5X1, P45578, 
P44007, 050164 and other related polypeptides. The crystals of the invention comprise 
crystallized polypeptides corresponding to the wild-type or mutated LuxS. The crystals of the 
invention include native crystals, in which the crystallized LuxS is substantially pure; heavy- 
atom derivative crystals, in which the crystallized LuxS is in association with one or more heavy- 
metal atoms; and co-crystals, in which the crystallized LuxS is in association with one or more 
compounds, including but not limited to, cofactors, ligands, substrates, substrate analogs, 
inhibitors, allosteric effectors, etc. to form a crystalline co-complex. Preferably, such compounds 
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bind a catalytic or active site or a site on the LuxS molecule that modulates a biological activity 
of the LuxS protein. The co-crystals may be native co-crystals, in which the co-complex is 
substantially pure, or they may be heavy-atom derivative co-crystals, in which the co-complex is 
in association with one or more heavy-metal atoms. 
5 The crystals of the invention were obtained for LuxS protein from three bacteria: 

Helicobacter pylori, Haemophilus influenzae, and Deinococcus radiodurans (two forms). They 
are generally characterized by a unit cell of a=71.04 +/-0.2 A, b-71.04 +/-0.2 A, c=130.14+/-0.2 
A, ct=90, fi=90, y=90 (for H. pylori); a=129.59 +/-0.2 A, b=129.59 +/-0.2 A, c=53.74+A0.2 A, a 
=90, B=90, y-90 (fori/, influenzae); a=43.71 +/-0.2 A, b=82.18 +/-0.2 A, C-49.48+/-0.2 A, a 
10 =90, B=90, y-90 (fori), radiodurans); and a=51.19 +/-0.2 A, b=70.14 +/-0.2 A, c=49.73+/-0.2 
A, a =90, 6=1 12.03, y=90 (also for D. radiodurans). The H. pylori protein grew crystals in space 
group P432]2. The H. influenzae protein grew crystals in space group Y^{L{L. The D. 
yj radiodurans protein grew crystals in both P2i and C2 space groups. These crystals are preferably 
fH j of diffraction quality. Representatative diffraction images for the four LuxS crystals are shown 
|1 in FIGS. 2-5. In more preferred embodiments, the crystals of the invention are of sufficient 
Ul quality to permit the determination of the three-dimensional X-ray diffraction structure of the 
.7 crystalline polypeptide to high resolution, preferably to a resolution of greater than about 3 A, 
17, typically in the range of about 1 A to about 3 A or in the range of about 1.8 A to about 2.4 A. 
£3 The invention also provides methods of making the crystals of the invention. Generally, 

?;§ native crystals of the invention are grown by dissolving substantially pure polypeptide in an 
^ aqueous buffer that includes a precipitant at a concentration just below that necessary to 
precipitate the polypeptide. Water is then removed by controlled evaporation to produce 
precipitating conditions, which are maintained until crystal growth ceases. 

Co-crystals of the invention are prepared by soaking a native crystal prepared according 
25 to the above method in a liquor comprising the compound of the desired co-complex. 

Alternatively, the co-crystals may be prepared by co-crystallizing the polypeptide in the presence 
of the compound according to the method discussed above. 

Heavy-atom derivative crystals of the invention may be prepared by soaking native 
crystals or co-crystals prepared according to the above method in a liquor comprising a salt of a 
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heavy atom or an organometallic compound. Alternatively, heavy-atom derivative crystals may 
be prepared by crystallizing a polypeptide comprising selenomethionine and/or selenocysteine 
residues according to the methods described previously for preparing native crystals. 

In another aspect, the invention provides machine and/or computer-readable media 
5 embedded with the three-dimensional structural information obtained from the crystals of the 
invention, or portions or subsets thereof. Such three-dimensional structural information will 
typically include the atomic structure coordinates of the crystallized polypeptide or co-complex, 
or the atomic structure coordinates of a portion thereof such as, for example, the atomic structure 
coordinates of the amino acid residues corresponding to an active or binding site, but may 
10 include other structural information, such as vector representations of the atomic structures 
coordinates, etc. The types of machine- or computer-readable media onto which the structural 
information is embedded typically include magnetic tape, floppy discs, hard disc storage media, 
S optical discs, CD-ROM, electrical storage media such as RAM or ROM, and hybrids of any of 
jl these storage media. Such media also include paper on which is recorded the structural 
m information that can be read by a scanning device and converted into a three-dimensional 

structure with an OCR and further includes stereo diagrams of three-dimensional structures from 
C3 which coordinates can be derived. The machine and/or computer-readable media of the 
U invention may further comprise additional information that is useful for representing the three- 
jj{ dimensional structure of the crystalline polypeptides, including, but not limited to, thermal 
2fr parameters, chain identifiers, and connectivity information. 

CI The invention is illustrated by way of working examples demonstrating the crystallization 

and characterization of crystals, the collection of diffraction data, and the determination and 
analysis of the three-dimensional structures of LuxS protein from these different bacterial 
species: Helicobacter pylori, Haemophilus influenzae and Deinococcus radiodurans. 

2 5 The atomic structure coordinates and machine readable media of the invention have a 

variety of uses. For example, the coordinates are useful for solving the three-dimensional X-ray 
diffraction and/or solution structures of other LuxSs, including mutant LuxS, co-complexes 
comprising LuxS, and unrelated LuxSs, to high resolution. Structural information may also be 
used in a variety of molecular modeling and computer-based screening applications to, for 
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example, intelligently design mutants of the crystallized LuxS that have altered biological 
activity and to computationally design and identify compounds that bind the polypeptide or a 
portion or fragment of the polypeptide, such as the active site. Such compounds may be used as 
lead compounds in pharmaceutical efforts to identify compounds that inhibit LuxS as a 
5 therapeutic approach toward the treatment of, e.g. , infectious disease (LuxS inhibitors may be 
good antibiotics), stomach cancer, stomach ulcers and other intestinal complications 
(Helicobacter pylori is a causative agent for stomach ulcers and stomach cancer). Such 
compounds can also be used to treat or prevent, for example, infectious disease caused by 
pathogens such as Haemophilus influenzae, Campylorbacter jejuni*, Salmonella typhimurium, 
10 Vibrio cholerae and Nesseria meningitidis. 

The examples demonstrate that the crystal structures of LuxS has been determined to 2.4 
A resolution (H. pylori), 2.1 A resolution (H. influenzae), and 2.1 A and 1.8 A resolution (D. 
radiodurans). 



il 5. BRIEF DESCRIPTION OF THE FIGURES AND TABLES 



5.1 Brief Description of the Figures 



FIG. 1 provides a structure-based sequence alignment of LuxS proteins; 



FIG. 2 provides a diffraction pattern of LuxS from H. pylori; 



FIG. 3 provides a diffraction pattern of LuxS from H. influenzae; 



25 



FIG. 4 provides a diffraction pattern of LuxS from D. radiodurans, P2i space group; 



FIG. 5 provides a diffraction pattern of LuxS from D. radiodurans, C2 space group; 
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FIG. 6A provides a ribbon diagram of molecule A of the D. radiodurans LuxS 
structure; 

FIG. 6B provides a ribbon diagram of molecule A of the H. influenzae LuxS structure; 

FIG. 6C provides a ribbon diagram of molecule B of the H. pylori P2i LuxS structure; 

FIG. 6D provides a ribbon diagram of the H. pylori C2 LuxS structure; 

FIG 7 provides a ribbon diagram of the H. pylori LuxS asymmetric unit contents; 

FIG. 8 provides a stereo view of a Ca trace of H. pylori LuxS. 

FIG. 9A provides a ribbon diagram illustrating the sequence-variable region of H. 
pylori LuxS; 

FIG. 9B provides a ribbon diagram illustrating a sequence-conserved region ofH. 
pylori LuxS; 

FIG 10A provides a ribbon diagram illustrating the substrate binding site region of H. 
pylori LuxS as a monomer; 

FIG. 10B provides a ribbon diagram illustrating the active site region qfH. pylori LuxS 
as a dimer; 

FIG. 11 provides a ribbon diagram of the active site region of H. influenzae LuxS 
showing the metal and methionine binding sites; 
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FIG. 12 provides a SPOCK diagram of the molecular surfaces of the two molecules in the 
asymmetric unit of K influenzae LuxS; 

FIG. 13A illustrates the electrostatic potential of the molecular surfaces of the 
dimerization interface ofH. influenzae LuxS; and 

FIG. 13B illustrates the location of conserved residues of the dimerization interface of 
H. influenzae LuxS. 

5.2 Brief Description of the Tables 

Table 1 shows the classifications of commonly encountered amino acids; 

Table 2 summarizes the X-ray crystallography data sets of LuxS crystals that were used 
to determine the structures of crystalline LuxS of the inventions as well as the results of the 
refinements; 

Table 3 presents an index of the diffraction image shown in FIG 2; 
Table 4 presents an index of the diffraction image shown in FIG 3; 
Table 5 presents an index of the diffraction image shown in FIG 4; 
Table 6 presents an index of the diffraction image shown in FIG 5; 
Table 7 summarizes the atomic structure coordinates of H. pylori LuxS; 
Table 8 summarizes the atomic structure coordinates of H. influenzae LuxS; 
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Table 9 summarizes the atomic structure coordinates of D. radiodurans P2i LuxS; and 

Table 10 summarizes the atomic structure coordinates of D. radiodurans C2 LuxS. 
6. DETAILED DESCRIPTION OF THE INVENTION 

6.1 Abbreviations 

The amino acid notations used herein for the twenty genetically encoded L-amino 
acids are conventional and are as follows: 



Amino Acid 


One-Letter 
Symbol 


Three-Letter 
Symbol 


Alanine 


A 


Ala 


Arginine 


R 


Arg 


Asparagine 


N 


Asn 


Aspartic acid 


D 


Asp 


Cysteine 


C 


Cys 


Glutamine 


Q 


Gin 


vjiuianuc acm 




Glu 


Glycine 


G 


Gly 


Histidine 


H 


His 


Isoleucine 


I 


He 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophan 


W 

8 


Trp 



One-Letter Three-Letter 
Amino Acid Symbol Symbol 

Tyrosine Y Tyr 

Valine V Val 



As used herein, unless specifically delineated otherwise, the three-letter amino acid 
abbreviations designate amino acids in the L-configuration. Amino acids in the D-configuration 
are preceded with a "D-." For example, Arg designates L-arginine and D-Arg designates D- 
arginine. Likewise, the capital one-letter abbreviations refer to amino acids in the L- 
configuration. Lower-case one-letter abbreviations designate amino acids in the D-configuration. 
For example, "R" designates L-arginine and "r" designates D-arginine. 

Unless noted otherwise, when polypeptide sequences are presented as a series of one- 
letter and/or three-letter abbreviations, the sequences are presented in the N -> C direction, in 
accordance with common practice. 

6.2 Definitions 

As used herein, the following terms shall have the following meanings: 

" Genetically Encoded Amino Acid " refers to L-isomers of the twenty amino acids that 
are defined by genetic codons. The genetically encoded amino acids are the L-isomers of 
glycine, alanine, valine, leucine, isoleucine, serine, methionine, threonine, phenylalanine, 
tyrosine, tryptophan, cysteine, proline, histidine, aspartic acid, asparagine, glutamic acid, 
glutamine, arginine and lysine. 

" Genetically Non-Encoded Amino Acid " refers to amino acids that are not defined by 
genetic codons. Genetically non-encoded amino acids include derivatives or analogs of the 
genetically-encoded amino acids that are capable of being enzymatically incorporated into 
nascent polypeptides using conventional expression systems, such as selenomethionine (SeMet) 
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and selenocysteine (SeCys); isomers of the genetically-encoded amino acids that are not 
capable of being enzymatically incorporated into nascent polypeptides using conventional 
expression systems, such as D-isomers of the genetically-encoded amino acids; L- and D- 
isomers of naturally occurring a-amino acids that are not defined by genetic codons, such as a 
5 -aminoisobutyric acid (Aib); L- and D-isomers of synthetic a-amino acids that are not defined 
by genetic codons; and other amino acids such as p-amino acids, y-amino acids, etc. In 
addition to the D-isomers of the genetically-encoded amino acids, exemplary common 
genetically non-encoded amino acids include, but are not limited to, nor leucine (Nle), 
penicillamine (Pen), N-methylvaline (MeVal), homocysteine (hCys), homoserine (hSer), 2,3- 

10 diaminobutyric acid (Dab) and ornithine (Orn). Additional exemplary genetically non-encoded 
amino acids are found, for example, in Practical Handbook of Biochemistry and Molecular 

_ Biology, 1989, Fasman, Ed., CRC Press, Inc., Boca Raton, FL, pp. 3-76 and the various 

O references cited therein. 

|j| " Hydrophilic Amino Acid " refers to an amino acid having a side chain exhibiting a 

^ hydrophobicity of less than zero according to the normalized consensus hydrophobicity scale of 

Eisenberg etal, 1984, J. Mol. Biol. 179:125-142. Exemplary genetically encoded 
P| hydrophilic amino acids include Thr (T), Ser (S), His (H), Glu (E), Asn (N), Gin (Q), Asp 
y (D), Lys (K) and Arg (R). Exemplary genetically non-encoded hydrophilic amino acids 
SI include the D-isomers of the above-listed genetically-encoded amino acids, ornithine (Orn), 

2,3-diaminobutyric acid (Dab) and homoserine (hSer). 

" Acidic Amino Acid " refers to a hydrophilic amino acid having a side chain pK value 
of less than 7 under physiological conditions. Acidic amino acids typically have negatively 
25 charged side chains at physiological pH due to loss of a hydrogen ion. Exemplary genetically 
encoded acidic amino acids include Glu (E) and Asp (D). Exemplary genetically non-encoded 
acidic amino acids include D-Glu (e) and D-Asp (d). 
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" Basic Amino Acid " refers to a hydrophilic amino acid having a side chain pK value of 
greater than 7 under physiological conditions. Basic amino acids typically have positively 
charged side chains at physiological pH due to association with hydronium ion. Exemplary 
genetically encoded basic amino acids include His (H), Arg (R) and Lys (K). Exemplary 
5 genetically non-encoded basic amino acids include the D-isomers of the above-listed 
genetically-encoded amino acids, ornithine (Orn) and 2,3-diaminobutyric acid (Dab). 

" Polar Amino Acid " refers to a hydrophilic amino acid having a side chain that is 
uncharged at physiological pH, but which comprises at least one covalent bond in which the 

10 pair of electrons shared in common by two atoms is held more closely by one of the atoms. 
Exemplary genetically encoded polar amino acids include Asn (N), Gin (Q), Ser (S), and Thr 

p (T). Exemplary genetically non-encoded polar amino acids include the D-isomers of the above- 

C* listed genetically-encoded amino acids and homoserine (hSer). 

SB " Hydrophobic Amino Acid " refers to an amino acid having a side chain exhibiting a 

I § hydrophobicity of greater than zero according to the normalized consensus hydrophobicity 
f s scale of Eisenberg et al, 1984, J. MoL Biol. 179:125-142. Exemplary genetically encoded 
[y hydrophobic amino acids include Pro (P), He (I), Phe (F), Val (V), Leu (L), Trp (W), Met 
j;; (M), Ala (A), Gly (G) and Tyr (Y). Exemplary genetically non-encoded hydrophobic amino 
So acids include the D-isomers of the above-listed genetically-encoded amino acids, norleucine 
(Nle) and N-methyl valine (MeVal). 

" Aromatic Amino Acid " refers to a hydrophobic amino acid having a side chain 
comprising at least one aromatic or heteroaromatic ring. The aromatic or heteroaromatic ring 
25 may include one or more substituents such as -OH, -SH, -CN, -F, -CI, -Br, -I, -NO2, -NO, 

-NH2, -NHR, -NRR, -C(0)R, -C(0)OH, -C(0)OR, -C(0)NH 2 , -C(0)NHR, -C(0)NRR and the 
like where each R is independently (Ci-Ce) alkyl, (C2-C6) alkenyl, or (C2-C6) alkynyl. 
Exemplary genetically encoded aromatic amino acids include Phe (F), Tyr (Y), Trp (W) and 
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His (H). Exemplary genetically non-encoded aromatic amino acids include the D-isomers of 
the above-listed genetically-encoded amino acids. 

" Apolar Amino Acid " refers to a hydrophobic amino acid having a side chain that is 
uncharged at physiological pH and which has bonds in which the pair of electrons shared in 
common by two atoms is generally held equally by each of the two. Exemplary genetically 
encoded apolar amino acids include Leu (L), Val (V), He (I), Met (M), Gly (G) and Ala (A). 
Exemplary genetically non-encoded apolar amino acids include the D-isomers of the above- 
listed genetically-encoded amino acids, norleucine (Nle) and N-methyl valine (Me Val). 

" Aliphatic Amino Acid " refers to a hydrophobic amino acid having an unsubstituted 
aliphatic hydrocarbon side chain. Exemplary genetically encoded aliphatic amino acids include 
Ala (A), Val (V), Leu (L) and He (I). Exemplary genetically non-encoded aliphatic amino acids 
include the D-isomers of the above-listed genetically-encoded amino acids, norleucine (Nle) 
and N-methyl valine (MeVal). 

" Helix-Breaking Amino Acid " refers to those amino acids that have a propensity to 
disrupt the structure of ot-helices when contained at internal positions within the helix. Amino 
acid residues exhibiting helix-breaking properties are well-known in the art {see, e.g., Chou & 
Fasman, 1978, Ann. Rev. Biochem. 47:251-276) and include, for example, Pro (P), D-Pro 
(p), Gly (G) and potentially all D-amino acids (when contained in an L-polypeptide; 
conversely, L-amino acids disrupt helical structure when contained in a D-polypeptide). 

" Cysteine-like Amino Acid " refers to an amino acid having a side chain capable of 
participating in a disulfide linkage. Thus, cysteine-like amino acids generally have a side chain 
containing at least one thiol (-SH) group. Cysteine-like amino acids are unusual in that they 
can form disulfide bridges with other cysteine-like amino acids. The ability of Cys (C) 
residues and other cysteine-like amino acids to exist in a polypeptide in either the reduced free 
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-SH or oxidized disulfide-bridged (-S-S-) form affects whether they contribute net hydrophobic 
or hydrophilic character to a polypeptide. Thus, while Cys (C) exhibits a hydrophobicity of 
0.29 according to the consensus scale of Eisenberg (Eisenberg, 1984, supra), it is to be 
understood that for purposes of the present invention Cys (C) is categorized as a polar 

5 hydrophilic amino acid, notwithstanding the general classifications defined above. Other 
cysteine-like amino acids are similarly categorized as polar hydrophilic amino acids. Typical 
cysteine-like residues include, for example, penicillamine (Pen), homocysteine (hCys), etc. 

As will be appreciated by those of skill in the art, the above-defined classes or 
categories are not mutually exclusive. Thus, amino acids having side chains exhibiting two or 

10 more physico-chemical properties can be included in multiple categories. For example, amino 
acid side chains having aromatic groups that are further substituted with polar substituents, 
such as His (H), may exhibit both aromatic hydrophobic properties and polar or hydrophilic 

Hi3 properties, and could therefore be included in both the aromatic and polar categories. 

ril Typically, amino acids will be categorized in the class or classes that most closely define their 

1| net physico-chemical properties. The appropriate categorization of any amino acid will be 

^ apparent to those of skill in the art. 

a The classifications of the genetically encoded and common non-encoded amino acids 

hi according to the categories defined above are summarized in Table 1, below. It is to be 
l % understood that Table 1 is for illustrative purposes only and does not purport to be an 
SI) exhaustive list of the amino acid residues belonging to each class. Other amino acid residues 

not specifically mentioned herein can be readily categorized based on their observed physical 

and chemical properties in light of the definitions provided herein. 
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TABLE 1 

CLASSIFICATIONS OF COMMONLY ENCOUNTERED AMINO ACIDS 



1 CI C C 1 "Fl CI "t 1 fill 

\^ictd ail n<& nun 


OpTipflrfillv 

VJ cut IX V tuij 


frenetically 




Encoded 


Non-Encoded 


Hydrophobic 






Aromatic 


F, Y, W, H 


f, y, w, h 


Apolar 


L, V, I, M, G, A, P 


1, v, i, m, a, p, Nle, MeVal 


Aliphatic 


A, V, L, I 


a, v, 1, i, Nle, MeVal 


Hydrophilic 






Acidic 


D, E 


d, e 


Basic 


H, K, R 


h, k, r, Orn, Dab 


Polar 


C, Q, N, S, T 


c, q, n, s, t, hSer 


Helix-Breaking 


P,G 


P 



5 y 

!# 

Q " LuxS polypeptide " or "LuxS" refers to a polypeptide comprising an amino acid 

sequence that corresponds to a wild-type LuxS polypeptide or a mutant LuxS polypeptide, as 
J;* defined below. 

is; st: 1 

I| " Wild-type LuxS polypeptide " or " wtLuxS " refers to a polypeptide comprising an 

amino acid sequence that corresponds identically to the amino acid sequence of a naturally- 
occurring LuxS. 

" Helicobacter pylori wtLuxS " refers to a polypeptide comprising an amino acid 
is sequence that corresponds identically to the wild-type LuxS from Helicobacter pylori (FIG. 1, 
SEQ ID NO:l). 
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" Haemophilus influenzae wtLuxS " refers to a polypeptide comprising an amino acid 
sequence that corresponds identically to the wild-type LuxS from Haemophilus influenzae 
(FIG. 1, SEQ ID NO:2). 



s " Deinococcus radiodurans wtLuxS " refers to a polypeptide comprising an amino acid 

sequence that corresponds identically to the wild-type LuxS from Deinococcus radiodurans 
(FIG. 1, SEQ ID NO:3). 

" Crystallized Helicobacter pylori LuxS " refers to a polypeptide comprising an amino 
10 acid sequence which corresponds identically to SEQ ID NO: 1, or a mutant thereof, and which 
is in crystalline form. 

l§ " Crystallized Haemophilus influenzae LuxS " refers to a polypeptide comprising an 

? jj amino acid sequence which corresponds identically to SEQ ID NO: 2, or a mutant thereof, and 
i€f which is in crystalline form. 



7 " Crystallized Deinococcus radiodurans LuxS " refers to a polypeptide comprising an 

I* amino acid sequence which corresponds identically to SEQ ID NO: 3, or a mutant thereof, and 
O which is in crystalline form. 

u " Association "refers to a condition of proximity between a chemical entity or 

compound, or portions or fragments thereof, and a polypeptide, or portions or fragments 
thereof. The association may be non-covalent, i.e. , where the juxtaposition is energetically 
favored by, e.g., hydrogen-bonding, van der Waals, electrostatic or hydrophobic interactions, 
25 or it may be covalent. 



" Co-Complex " refers to a LuxS polypeptide in association with one or more 
compounds. Such compounds include, by way of example and not limitation, cofactors, 
ligands, substrates, substrate analogues, inhibitors, allosteric effectors, etc. 
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" Mutant LuxS polypeptide " or " mLuxS " or " Mutant " refers to a polypeptide 
characterized by an amino acid sequence that differs from the wild-type sequence by the 
substitution of at least one amino acid residue of the wild-type sequence with a different amino 
acid residue and/or by the addition and/or deletion of one or more amino acid residues to or 
from the wild-type sequence. The additions and/or deletions can be from an internal region of 
the wild-type sequence and/or at either or both of the N- or C-termini. A mutant may have, 
but need not have, LuxS activity. Preferably, a mutant displays biological activity that is 
substantially similar to that of the wild-type LuxS. 

" Conservative Mutant " refers to a mutant in which at least one amino acid residue from 
the wild-type sequence is substituted with a different amino acid residue that has similar 
physical and chemical properties, i.e. , an amino acid residue that is a member of the same 
class or category, as defined above. For example, a conservative mutant may be a polypeptide 
that differs in amino acid sequence from the wild-type sequence by the substitution of a 
specific aromatic Phe (F) residue with an aromatic Tyr (Y) or Trp (W) residue. 

" Non-Conservative Mutant " refers to a mutant in which at least one amino acid residue 
from the wild-type sequence is substituted with a different amino acid residue that has 
dissimilar physical and/or chemical properties, i.e. , an amino acid residue that is a member of 
a different class or category, as defined above. For example, a non-conservative mutant may 
be a polypeptide that differs in amino acid sequence from the wild-type sequence by the 
substituion of an acidic Glu (E) residue with a basic Arg (R), Lys (K) or Orn residue. 

" Deletion Mutant " refers to a mutant having an amino acid sequence that differs from 
the wild-type sequence by the deletion of one or more amino acid residues from the wild-type 
sequence. The residues may be deleted from internal regions of the wild-type sequence and/or 
from one or both termini. 



261295-1 



16 



" Truncated Mutant " refers to a deletion mutant in which the deleted residues are from 
the N- and/or C-terminus of the wild-type sequence. 

" Extended Mutant " refers to a mutant in which additional residues are added to the N- 
5 and/or C-terminus of the wild-type sequence. 

" Methionine mutant " refers to (1) a mutant in which at least one methionine residue of 
the wild-type sequence is replaced with another residue, preferably with an aliphatic residue, 
most preferably with a Leu (L) or He (I) residue; or (2) a mutant in which a non-methionine 
10 residue, preferably an aliphatic residue, most preferably a Leu (L) or He (I) residue, of the 
wild-type sequence is replaced with a methoinine residue. 

5 " Selenomethionine mutant " refers to (1) a mutant which includes at least one 

r ?t selenomethionine (SeMet) residue, typically by substitution of a Met residue of the wild-type 

s is? 

|g sequence with a SeMet residue, or by addition of one or more SeMet residues at one or both 
LJ termini, or (2) a methionine mutant in which at least one Met residue is substituted with a 
g W SeMet residue. Preferred SeMet mutants are those in which each Met residue is substituted 
with a SeMet residue. 



!;!> " Cysteine mutant " refers to (1) a mutant in which at least one cysteine residue of the 

u wild-type sequence is replaced with another residue, preferably with a Ser (S) residue; or (2) a 

mutant in which a non-cysteine residue, preferably a Ser (S) residue, of the wild-type sequence 

is replaced with a cysteine residue. 

25 " Selenocysteine mutant " refers to (1) a mutant which includes at least one 

selenocysteine (SeCys) residue, typically by substitution of a Cys residue of the wild-type 
sequence with a SeCys residue, or by addition of one or more SeCys residues at one or both 
termini, or (2) a cysteine mutant in which at least one Cys residue is substituted with a SeCys 
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residue. Preferred SeCys mutants are those in which each Cys residue is substituted with a 
SeCys residue. 

" Homologue " refers to a polypeptide having at least 70%, 80%, 90%, 95% or 99% 
5 amino acid sequence identity or having a BLAST score of 1 x 10" 6 over at least 100 contiguous 
amino acids (Altschul et al., 1997, Nucleic Acids Res. 25:3389-402) with a wtLuxS. 

" Crystal " refers to a composition comprising a polypeptide in crystalline form. The 
term "crystal" includes native crystals, heavy-atom derivative crystals and co-crystals, as 
io defined herein. 

_ " Native Crystal " refers to a crystal wherein the polypeptide is substantially pure. As 

*l3 used herein, native crystals do not include crystals of polypeptides comprising amino acids that 
f Li are modified with heavy atoms, such as crystals of selenomethionine mutants, selenocysteine 
B§ mutants, etc. 

=s " Heavy-atom Derivative Crystal " refers to a crystal wherein the polypeptide is in 

fin association with one or more heavy-metal atoms. As used herein, heavy-atom derivative 
y crystals include native crystals into which a heavy metal atom is soaked, as well as crystals of 
§5 selenomethionine mutants and selenocysteine mutants. 

" Co-Crystal " refers to a composition comprising a co-complex, as defined above, in 
crystalline form. Co-crystals include native co-crystals and heavy-atom derivative 
co-crystals. 

25 

" Diffraction Quality Crystal " refers to a crystal that is well-ordered and of a sufficient 
size, i.e., at least 10 jam, preferably at least 50 |im, and most preferably at least 100 \im in its 
smallest dimension such that it produces measurable diffraction to at least 3 A resolution, 
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preferably to at least 2.4 or 1.8 A resolution, and most preferably to at least 1.5 A resolution 
or greater resolution. Diffraction quality crystals include native crystals, heavy-atom derivative 
crystals, and co-crystals. 

5 " Unit Cell " refers to the smallest and simplest volume element {i.e. , parallelpiped- 

shaped block) of a crystal that is completely representative of the unit or pattern of the crystal, 
such that the entire crystal can be generated by translation of the unit cell. The dimensions of 
the unit cell are defined by six numbers: dimensions a, b and c and angles a, p and y (Blundel 
et a/., 1976, Protein Crystallography, Academic Press.). A crystal is an efficiently packed 

10 array of many unit cells. 

" Triclinic Unit Cell " refers to a unit cell in which a=£b=£c and a 4=- p + y. 

fyi " Monoclinic Unit Cell " refers to a unit cell in which a^b=£c; a = y =90°; and p is 

11 defined to be > 90°. 

LI 

yJ 

* " Orthorhombic Unit Cell " refers to a unit cell in which a*b*c; and a = p=y=90°. 

*;S " Tetragonal Unit Cell " refers to a unit cell in which a=b*c; and cc=p=y=90°. 

" Trigonal/Rhombohedral Unit Cell " refers to a unit cell in which a=b=c; and a=p=y 

*90°. 

" Trigonal/Hexagonal Unit Cell " refers to a unit cell in which a=b=c; ot=p=90°; and 
25 y=120°. 

" Cubic Unit Cell " refers to a unit cell in which a=b=c; and a=p=y=90°. 
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" Crystal Lattice " refers to the array of points defined by the vertices of packed unit 

cells. 



" Space Group " refers to the set of symmetry operations of a unit cell. In a space group 
designation (e.g. , C2), the capital letter indicates the lattice type and the other symbols 
represent symmetry operations that can be carried out on the unit cell without changing its 
appearance. 

" Asymmetric Unit " refers to the largest aggregate of molecules in the unit cell that 
possesses no symmetry elements that are part of the space group symmetry, but that can be 
juxtaposed on other identical entities by symmetry operations. 

" Cry stallographically-Related Dimer " refers to a dimer of two molecules wherein the 
symmetry axes or planes that relate the two molecules comprising the dimer coincide with the 
symmetry axes or planes of the crystal lattice. 

" Non-Cry stallographically-Related Dimer " refers to a dimer of two molecules wherein 
the symmetry axes or planes that relate the two molecules comprising the dimer do not 
coincide with the symmetry axes or planes of the crystal lattice. 

" Isomorphous Replacement " refers to the method of using heavy-atom derivative 
crystals to obtain the phase information necessary to elucidate the three-dimensional structure 
of a crystallized polypeptide (Blundel et aL, 1976, Protein Crystallography, Academic Press.). 

" Multi- Wavelength Anomalous Dispersion or MAD " refers to a crystallographic 
technique in which X-ray diffraction data are collected at several different wavelengths from a 
single heavy-atom derivative crystal, wherein the heavy atom has absorption edges near the 
energy of incoming X-ray radiation. The resonance between X-rays and electron orbitals leads 
to differences in X-ray scattering from absorption of the X-rays (known as anomalous 
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scattering) and permits the locations of the heavy atoms to be identified, which in turn provides 
phase information for a crystal of a polypeptide. A detailed discussion of MAD analysis can 
be found in Hendrickson, 1985, Trans. Am. Crystallogr. Assoc., 21:11; Hendrickson et ah, 
1990, EMBO J. 9:1665; and Hendrickson, 1991, Science 4:91. 

5 

" Single Wavelength Anomalous Dispersion or SAP " refers to a crystallographic 
technique in which X-ray diffraction data are collected at a single wavelength from a single 
native or heavy-atom derivative crystal, and phase information is extracted using anomalous 
scattering information from atoms such as sulfur or chlorine in the native crystal or from the 

10 heavy atoms in the heavy-atom derivative crystal. The wavelength of X-rays used to collect 
data for this phasing technique need not be close to the absorption edge of the anomalous 
scatterer. A detailed discussion of SAD analysis can be found in Brodersen et al., 2000, Acta 

J5 Cryst., D56:431-441. 

if " Single Isomorphous Replacement With Anomalous Scattering or SIRAS " refers to a 

L J crystallographic technique that combines isomorphous replacement and anomalous scattering 
* w techniques to provide phase information for a crystal of a polypeptide. X-ray diffraction data 

are collected at a single wavelength, usually from a single heavy-atom derivative crystal. 
O Phase information obtained only from the location of the heavy atoms in a single heavy-atom 
2^ derivative crystal leads to an ambiguity in the phase angle, which is resolved using anomalous 
u scattering from the heavy atoms. Phase information is therefore extracted from both the 

location of the heavy atoms and from anomalous scattering of the heavy atoms. A detailed 

discussion of SIRAS analysis can be found in North, 1965, Acta Cryst. 18:212-216; Matthews, 

1966, Acta Cryst. 20:82-86. 

25 

" Molecular Replacement " refers to the method of calculating initial phases for a new 
crystal of a polypeptide whose structure coordinates are unknown by orienting and positioning 
a polypeptide whose structure coordinates are known within the unit cell of the new crystal so 
as to best account for the observed diffraction pattern of the new crystal. Phases are then 
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calculated from the oriented and positioned polypeptide and combined with observed 
amplitudes to provide an approximate Fourier synthesis of the structure of the polypeptides 
comprising the new crystal. (Lattman, 1985, Methods in Enzymology 115:55-77; Rossmann, 
1972, "The Molecular Replacement Method," Int. Sci. Rev. Ser. No. 13, Gordon & Breach, 
New York.). 

" Having substantially the same three-dimensional structure " refers to a polypeptide that 
is characterized by a set of atomic structure coordinates that have a root mean square deviation 
(r.m.s.d.) of less than or equal to about 2 A when superimposed onto the atomic structure 
coordinates of Table 7, Table 8, Table 9, or Table 10 when at least about 50% to 100% of 
contiguous or non-contiguous Cot atoms of the coordinates are included in the superposition. 

" Cq ; " As used herein, "Ca" refers to the alpha carbon of an amino acid residue. 

6.3 Crystalline LuxS 

The crystals from which the atomic structure coordinates of the invention may be 
obtained include native crystals and heavy-atom derivative crystals. Native crystals generally 
comprise substantially pure polypeptides corresponding to LuxS in crystalline form. 

It is to be understood that the crystalline LuxS from which the atomic structure 
coordinates of the invention can be obtained is not limited to wild-type LuxS. Indeed, the 
crystals may comprise mutants of wild-type LuxS. Mutants of wild-type LuxS are obtained by 
replacing at least one amino acid residue in the sequence of the wild-type LuxS with a different 
amino acid residue, or by adding or deleting one or more amino acid residues within the wild- 
type sequence and/or at the N- and/or C-terminus of the wild-type LuxS. Preferably, such 
mutants will crystallize under crystallization conditions that are substantially similar to those 
used to crystallize the wild-type LuxS. 

The types of mutants contemplated by this invention include conservative mutants, non- 
conservative mutants, deletion mutants, truncated mutants, extended mutants, methionine 



261295-1 



22 



mutants, selenomethionine mutants, cysteine mutants and selenocysteine mutants. A mutant 
may have, but need not have, LuxS activity. Preferably, a mutant displays biological activity 
that is substantially similar to a biological activity of the wild-type polypeptide. Methionine, 
selenomethione, cysteine, and selenocysteine mutants are particularly useful mutants, as they 
may be used to produce heavy-atom derivative crystals, as described in detail, below. 

It will be recognized by one of skill in the art that the types of mutants contemplated 
herein are not mutually exclusive; that is, for example, a polypeptide having a conservative 
mutation in one amino acid may in addition have a truncation of residues at the N-terminus, 
and several Leu or He -> Met mutations. 

Sequence alignments of polypeptides in a LuxS family or of homologous polypeptide 
domains can be used to identify potential amino acid residues in the polypeptide sequence that 
are candidates for mutation (see FIG. 1). Identifying mutations that do not significantly 
interfere with the three-dimensional structure of LuxS and/or that do not deleteriously affect, 
and that may even enhance, a biological activity of LuxS will depend, in part, on the region 
where the mutation occurs. In highly variable regions of the molecule, such as those shown in 
FIG. 9A, non-conservative substitutions as well as conservative substitutions may be tolerated 
without significantly disrupting the three-dimensional structure and/or biological activity of the 
molecule. In highly conserved regions, or regions containing significant secondary structure, 
such as those regions shown in FIG. 9B, conservative amino acid substitutions are preferred. 

Conservative amino acid substitutions are well-known in the art, and include 
substitutions made on the basis of a similarity in polarity, charge, solubility, hydrophobicity 
and/or the hydrophilicity of the amino acid residues involved. Typical conservative 
substitutions are those in which a wild type amino acid is substituted with a different amino 
acid that is a member of the same class or category, as those classes are defined herein. Thus, 
typical conservative substitutions include aromatic to aromatic, apolar to apolar, aliphatic to 
aliphatic, acidic to acidic, basic to basic, polar to polar, etc. Other conservative amino acid 
substitutions are well known in the art. It will be recognized by those of skill in the art that 
generally, a total of about 20% or fewer, typically about 10% or fewer, most usually about 
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5% or fewer, of the amino acids in the wild-type polypeptide sequence can be conservatively 
substituted with other amino acids without deleteriously affecting the biological activity and/or 
three-dimensional structure of the molecule, provided that such substitutions do not involve 
residues that are critical for activity, as discussed above. 
5 Inspection of the various LuxS structures of the invention reveals that a metal binidng 

site is a likely candidate for being at least in part responsible for the known enzymatic activity 
of the wtLuxS polypeptide. The residues involved in the metal binding are His 57, His 61, 
Cys 131, and His 137 ("metal binding site"). EXAFS data on LuxS crystals clearly indicate 
this metal is a zinc atom or ion. Nearby, on the same face of the LuxS polypepetide, is bound a 
10 methionine molecule. This methionine is specifically held by its backbone atoms through 
hydrogen bonds with residues Arg 68, Asp 80, and He 81 ("amino acid binding site"). The 
sidechain of the bound methionine points toward the metal binding site, an arrangement that is 
y3 common for substrates of metalloenzymes thereby indicating that the bound methionine might 
m be bound at a substrate binding site of LuxS (see FIG. 1 1). The metal and amino acid binding 
S sites both lay at the interface of the homodimer, indicating the biological relevance of the 
W homodimer in the function of LuxS. 

Methionine is not a likely substrate of LuxS. However, modeling of a potential 
substrate of LuxS into the site occupied by the bound methionine residue indicated a potential 
y active site of the enzyme. The potential substrate, S-ribosylhomocysteine contacted several 
gp residues of the LuxS polypeptide including Ser 9, His 14, Arg 23 , Asp 40, Arg 42, Glu 60, Met 
84, Cyc 86, Thr 88, and Tyr 91 ("active site"). The active site of LuxS is described in detail 
below. In addition, all four LuxS polypeptides crystallized as dimers. The structures of LuxS 
revealed that several residues contribute to the dimerization interface including His 14, Pro 46, 
Asn 47, residue 50 (usually a hydrophobic residue), His 57, Glu 60, Cys 86, residue 87 (usually 
25 Arg), residue 128 (usually Glu), residue 129 (usually hydrophobic), Gly 132, residue 134 (usually 
hydrophobic) and residue (usually hydrophobic). These residues comprise the dimerization 
interface of LuxS ("dimerization interface"). Those of skill in the art will recognize that each 
region identified in a LuxS polypeptide can also be identified in other LuxS polypeptides with 
one or more conservative or non-conservative mutations at any residue of the region. 
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In some embodiments, it may be desirable to make mutations in the metal binding site, 
the amino acid binding site, the active site or the dimerization interface of a LuxS, e.g., to 
reduce or completely eliminate LuxS activity. For example, it may be desirable to mutate 
important residues in one or more of these sites of LuxS including, for example, His 57, His 
5 61, Cys 131, His 137, Arg 68, Asp 80 or lie 81. Other mutations that will reduce or 

completely eliminate the activity of a particular LuxS will be apparent to those of skill in the 
art. 

While in most instances the amino acids of wtLuxS will be substituted with genetically- 
encoded amino acids, in certain circumstances mutants may include genetically non-encoded 
10 amino acids. For example, non-encoded derivatives of certain encoded amino acids, such as 

SeMet and/or SeCys, may be incorporated into the polypeptide chain using biological 
S9 . expression systems (such SeMet and SeCys mutants are described in more detail, infra). 
CJ Alternatively, in instances where the mutant will be prepared in whole or in part by 

fyj chemical synthesis, virtually any non-encoded amino acids may be used, ranging from D- 
|| isomers of the genetically encoded amino acids to non-encoded naturally-occurring natural and 
w synthetic amino acids. 

s Conservative amino acid substitutions for many of the commonly known non- 

Ui genetically encoded amino acids are well known in the art. Conservative substitutions for 
^ other non-encoded amino acids can be determined based on their physical properties as 
81 compared to the properties of the genetically encoded amino acids. 

In some instances, it may be particularly advantageous or convenient to substitute, 
delete from and/or add residues to wtLuxS in order to provide convenient cloning sites in 
cDNA encoding the polypeptide, to aid in purification of the polypeptide, etc. Such 
substitutions, deletions and/or additions that do not substantially alter the three dimensional 
25 structure of the wtLuxS will be apparent to those having skills in the art. These substitutions, 
deletions and/or additions include, but are not limited to, His tags, intein-containing self- 
cleaving tags, fusions of wtLuxS with other peptides, proteins, polypeptides or proteins such as 
maltose binding protein, glutathione S-transferase, antibodies, green fluorescent proteins, 
signal peptides, biotin accepting peptides, and the like. For instance, the LuxS polypeptides 

261295-1 25 



whose structures were determined as described in the Examples below possessed C-terminal 
His-tags (i.e. a Gly-Ser-His-His-His-His-His-His sequence added recombinantly to the C- 
terminal ends of the sequences in FIG. 1). 

Mutations may also be introduced into a polypeptide sequence where there are residues, 
5 e.g. , cysteine residues, that interfere with crystallization. Such cysteine residues can be 

substituted with an appropriate amino acid that does not readily form covalent bonds with other 
amino acid residues under crystallization conditions; e.g. , by substituting the cysteine with 
Ala, Ser or Gly. Any cysteine located in a non-helical or non-(3-stranded segment, based on 
secondary structure assignments, are good candidates for replacement. There are only two 
10 cysteine residues in the LuxS polypeptide crystallized in this invention and one, and possibly 
both are involved in the putative active site of this enzyme. Mutation of one or both of these 
residues would likely not effect the fold (structure) of this polypeptide but could likely effect 
CI an activity of the protein. Conservative mutation of such cysteine residues are preferred where 
rll active LuxS polypeptides are desired. Non-conservative mutation of such cysteine residues are 
ft preferable when a biological activity of LuxS, such as metal ion binding, can be reduced or 
W eliminated. Such non-conservative mutants can be used to, for example, study the allosteric 
% effects of metal ion binding on LuxS. 

f i § It should be noted that the mutants contemplated herein need not exhibit LuxS activity. 

y Indeed, amino acid substitutions, additions or deletions that interfere with the activity of LuxS 

ib are specifically contemplated by the invention. Such crystalline polypeptides, or the atomic 
structure coordinates obtained therefrom, can be used to provide phase information to aid the 
determination of the three-dimensional X-ray structures of other related or non-related 
crystalline polypeptides. 

The heavy-atom derivative crystals from which the atomic structure coordinates of the 

2 5 invention are obtained generally comprise a crystalline LuxS polypeptide in association with 
one or more heavy metal atoms. The polypeptide may correspond to a wild-type or a mutant 
LuxS, which may optionally be in co-complex with one or more molecules, as previously 
described. There are two types of heavy-atom derivatives of polypeptides. In one form, 
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heavy-atom derivatives result from exposure of the LuxS to a heavy metal in solution, wherein 
crystals are grown in medium comprising the heavy metal, or in crystalline form, wherein the 
heavy metal diffuses into the crystal. In another form, heavy-atom derivatives comprise 
polypeptides having heavy-atom containing amino acids, e.g., selenomethionine and/or 
5 selenocysteine mutants. 

In practice, heavy-atom derivatives of the first type can be formed by soaking a native 
crystal in a solution comprising heavy metal atom salts, or organometallic compounds, e.g., 
lead chloride, gold thiomalate, ethylmercurithiosalicylic acid-sodium salt (thimerosal), uranyl 
acetate, platinum tetrachloride, osmium tetraoxide, zinc sulfate, and cobalt hexamine, which 
10 can diffuse through the crystal and bind to the crystalline polypeptide. 

Heavy-atom derivatives of this type can also be formed by adding to a crystallization 
^ solution comprising the polypeptide to be crystallized an amount of a heavy metal atom salt, 
^3 which may associate with the LuxS and be incorporated into the crystal. The location(s) of the 
f LI bound heavy metal atom(s) can be determined by X-ray diffraction analysis of the crystal. 
IS This information, in turn, is used to generate the phase information needed to construct the 
if three-dimensional structure of the crystalline LuxS. 

s Heavy-atom derivative crystals may also be prepared from polypeptides that include 

riii one or more SeMet and/or SeCys residues (SeMet and/or SeCys mutants). Such selenocysteine 
y or selenomethionine mutants may be made from wild-type or mutant LuxS by expression of 
6b LuxS-encoding cDNAs in auxotrophic E. coli strains (Hendrickson et al , 1990, EMBO J. 
9(5): 1665-1672). In this method, the wild-type or mutant LuxS cDNA may be expressed in a 
host organism on a growth medium depleted of either natural cysteine or methionine (or both) 
but enriched in selenocysteine or selenomethionine (or both). Alternatively, selenocysteine or 
selenomethionine mutants may be made using nonauxotrophic E. coli strains, e.g. , by 
25 inhibiting methionine biosynthesis in these strains with high concentrations of lie, Lys, Phe, 
Leu, Val or Thr and then providing selenomethionine in the medium (Doublie, 1997, Methods 
in Enzymology 276:523-530). Furthermore, selenocysteine can be selectively incorporated 
into polypeptides by exploiting the prokaryotic and eukaryotic mechanisms for selenocysteine 
incorporation into certain classes of LuxSs in vivo, as described in U.S. Patent No. 5,700,660 
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to Leonard et al (filed June 7, 1995). One of skill in the art will recognize that selenocysteine 
is preferably not incorporated in place of cysteine residues that form disulfide bridges, as these 
may be important for maintaining the three-dimensional structure of the LuxS and are 
preferably not to be eliminated. One of skill in the art will further recognize that, in order to 
5 obtain accurate phase information, approximately one selenium atom should be incorporated 
for every 140 amino acid residues of the polypeptide chain. The number of selenium atoms 
incorporated into the polypeptide chain can be conveniently controlled by designing a Met or 
Cys mutant having an appropriate number of Met and/or Cys residues, as described more fully 
below. 

10 In some instances, the polypeptide to be crystallized may not contain cysteine or 

methionine residues. Therefore, if selenomethionine and/or selenocysteine mutants are to be 

^ used to obtain heavy-atom derivative crystals, methionine and/or cysteine residues may be 
introduced into the polypeptide chain. Likewise, Cys residues must be introduced into the 

f LI polypeptide chain if the use of a cysteine-binding heavy metal, such as mercury, is 

|fe contemplated for production of a heavy-atom derivative crystal. 

W Such mutations are preferably introduced into the polypeptide sequence at sites that will 

not disturb the overall LuxS fold. For example, a residue that is conserved among many 
? 1 1 members of the LuxS family or that is thought to be involved in maintaining its activity or 
5 structural integrity, as determined by, e.g. , sequence alignments, should not be mutated to a 
Sb Met or Cys. In addition, conservative mutations, such as Ser to Cys, or Leu or He to Met, are 
preferably introduced. One additional consideration is that, in order for a heavy-atom 
derivative crystal to provide phase information for structure determination, the location of the 
heavy atom(s) in the crystal unit cell must be determinable and provide phase information. 
Therefore, a mutation is preferably not introduced into a portion of the LuxS that is likely to 
25 be mobile, e.g. , at, or within about 1-5 residues of, the N- and C-termini. 

Conversely, if there are too many methionine and/or cysteine residues in a polypeptide 
sequence, over-incorporation of the selenium-containing side chains can lead to the inability of 
the polypeptide to fold and/or crystallize, and may potentially lead to complications in solving 
the crystal structure. In this case, methionine and/or cysteine mutants are prepared by 
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substituting one or more of these Met and/or Cys residues with another residue. The 
considerations for these substitutions are the same as those discussed above for mutations that 
introduce methionine and/or cysteine residues into the polypeptide. Specifically, the Met 
and/or Cys residues are preferably conservatively substituted with Leu/Ile and Ser, 
5 respectively. 

As DNA encoding cysteine and methionine mutants can be used in the methods 
described above for obtaining SeCys and SeMet heavy-atom derivative crystals, the preferred 
Cys or Met mutant will have one Cys or Met residue for every 140 amino acids. 

10 6.4 Production of Polypeptides 

The native and mutated LuxS polypeptides described herein may be chemically 
n synthesized in whole or part using techniques that are well-known in the art (see, e.g. , 
O Creighton, 1983, Proteins: Structures and Molecular Principles, W.H. Freeman & Co., NY.), 
nj Alternatively, methods that are well known to those skilled in the art can be used to construct 
IS expression vectors containing the native or mutated LuxS polypeptide coding sequence and 
*i appropriate transcriptional/translational control signals. These methods include in vitro 
s recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic 
hi recombination. See, for example, the techniques described in Maniatis et al. , 1989, Molecular 
IS Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY and Ausubel et al. , 
fib 1989, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, NY. 

A variety of host-expression vector systems may be utilized to express the LuxS coding 
sequence. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
25 containing the LuxS coding sequence; yeast transformed with recombinant yeast expression 
vectors containing the LuxS coding sequence; insect cell systems infected with recombinant 
virus expression vectors {e.g., baculovirus) containing the LuxS coding sequence; plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
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vectors (e.g., Ti plasmid) containing the LuxS coding sequence; or animal cell systems. The 
expression elements of these systems vary in their strengths and specificities. 

Specifically designed vectors allow the shuttling of DNA between hosts such as 
bacteria-yeast or bacteria-animal cells. An appropriately constructed expression vector may 
include: an origin of replication for autonomous replication in host cells, selectable markers, a 
limited number of useful restriction enzyme sites, a potential for high copy number, and active 
promoters. A promoter is defined as a DNA sequence that directs RNA polymerase to bind to 
DNA and initiate RNA synthesis. A strong promoter is one that causes mRNAs to be initiated 
at high frequency. 

Depending on the host/vector system utilized, any of a number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used in the 
expression vector. For example, when cloning in bacterial systems, inducible promoters such 
as the T7 promoter, pL of bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the 
like may be used; when cloning in insect cell systems, promoters such as the baculovirus 
polyhedrin promoter may be used; when cloning in plant cell systems, promoters derived from 
the genome of plant cells (e.g. , heat shock promoters; the promoter for the small subunit of 
RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant viruses (e.g. , 
the 35S RNA promoter of CaMV; the coat protein promoter of TMV) may be used; when 
cloning in mammalian cell systems, promoters derived from the genome of mammalian cells 
(e.g. , metallothionein promoter) or from mammalian viruses (e.g. , the adenovirus late 
promoter; the vaccinia virus 7.5K promoter) may be used; when generating cell lines that 
contain multiple copies of the tyrosine kinase domain DNA, SV40-, BPV- and EBV-based 
vectors may be used with an appropriate selectable marker. 

The expression vector may be introduced into host cells via any one of a number of 
techniques including but not limited to transformation, transfection, infection, protoplast 
fusion, and electroporation. The expression vector-containing cells are clonally propagated 
and individually analyzed to determine whether they produce LuxS. Identification of LuxS 
expressing host cell clones may be done by several means, including but not limited to 
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immunological reactivity with anti-LuxS antibodies, and the presence of host cell-associated 
LuxS activity. 

Expression of LuxS cDNA may also be performed using in vitro produced synthetic 
mRNA. Synthetic mRNA can be efficiently translated in various cell-free systems, including 
but not limited to wheat germ extracts and reticulocyte extracts, as well as efficiently translated 
in cell based systems, including but not limited to microinjection into frog oocytes. 

To determine the LuxS cDNA sequence(s) that yields optimal levels of LuxS activity 
and/or protein LuxS, modified LuxS cDNA molecules are constructed. Host cells are 
transformed with the cDNA molecules and the levels of LuxS RNA and/or LuxS are measured. 

Levels of LuxS in host cells are quantitated by a variety of methods such as 
immunoaffinity and/or ligand affinity techniques, LuxS-specific affinity beads or LuxS-specific 
antibodies are used to isolate 35 S-methionine labeled or unlabeled LuxS. Labeled or unlabeled 
LuxS is analyzed by SDS-PAGE. Unlabeled LuxS is detected by Western blotting, ELISA or 
RIA employing LuxS-specific antibodies. LuxS-specific antibodies can be obtained by 
techniques well known to those of skill in the art including, for instance, those disclosed in 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. 

Following expression of LuxS in a recombinant host cell, LuxS may be recovered to 
provide LuxS in active form. Several protein purification procedures are available and suitable 
for use. Recombinant LuxS may be purified from cell ly sates or from conditioned culture 
media, by various combinations of, or individual application of, fractionation, or 
chromatography steps that are known in the art. 

In addition, recombinant LuxS can be separated from other cellular proteins by use of 
an immuno-affinity column made with monoclonal or polyclonal antibodies specific for full 
length nascent LuxS or polypeptide fragments thereof. 

Alternatively, LuxS may be recovered from a host cell in an unfolded, inactive form, 
e.g. , from inclusion bodies of bacteria. LuxSs recovered in this form may be solublized using 
a denaturant, e.g. , guanidinium hydrochloride, and then refolded into an active form using 
methods known to those skilled in the art, such as dialysis. 
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6.5 Crystallization Of LuxS Polypeptides And Characterization Of Crystals 

The native, heavy-atom derivative and co-crystals from which the atomic structure 
coordinates of the invention are obtained can be obtained by conventional means as are well- 
5 known in the art of protein crystallography, including batch, liquid bridge, dialysis, and vapor 
diffusion methods (see, e.g., McPherson, 1998, Crystallization of Biological Macromolecules, 
Cold Spring Harbor Press, New York; McPherson, 1990, Eur. J. Biochem. 189:1-23.; Weber, 
1991, Adv. Protein Chem. 41:1-36.). 

Generally, native crystals are grown by dissolving substantially pure LuxS polypeptide 
10 in an aqueous buffer comprising a precipitant at a concentration just below that necessary to 
precipitate the LuxS. Examples of precipitants include, but are not limited to, polyethylene 
glycol, ammonium sulfate, 2-methyl-2,4-pentanediol, sodium citrate, sodium chloride, 
O glycerol, isopropanol, lithium sulfate, sodium acetate, sodium formate, potassium sodium 
Q tartrate, ethanol, hexanediol, ethylene glycol, dioxane, t-butanol and combinations thereof. 
!jp Water is removed by controlled evaporation to produce precipitating conditions, which are 
'fi maintained until crystal growth ceases. 

M In a preferred embodiment, native crystals are grown by vapor diffusion in sitting drops 

U (McPherson, 1982, Preparation and Analysis of Protein Crystals, John Wiley, New York; 
J| McPherson, 1990, Eur. J. Biochem. 189:1-23.). In this method, the polypeptide/precipitant 
§p solution is allowed to equilibrate in a closed container with a larger aqueous reservoir having a 
O precipitant concentration optimal for producing crystals. Generally, less than about 25 jjJL of 
substantially pure polypeptide solution is mixed with an equal volume of reservoir solution, 
giving a precipitant concentration about half that required for crystallization. The sealed 
container is allowed to stand, usually for about 2-6 weeks, until crystals grow. 
25 For native crystals from which the atomic structure coordinates of the invention are 

obtained, it has been found that sitting drops comprising about 1 jaL ofH. pylori LuxS 
polypeptide (5 mg/mL in 10 mM HEPES, pH 7.5, 150 mM sodium chloride, 10 mM 
methionine, 1 mM beta-mercaptoethanol) with 1 |uL reservoir solution (32% w/v PEG 1000, 
200 mM ammonium sulfate, and 100 mM MES, pH 5.75) suspended over 0.5 mL reservoir 
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solution for about one week at 20°C provide diffraction quality crystals. Similarly, sitting 
drops prepared by mixing about 1 pL of D. radiodurans LuxS polypeptide (19 mg/mL in 10 
mM HEPES, pH 7.5, 150 mM sodium chloride, 10 mM methionine, 1 mM beta- 
mercaptoethanol) and 1 jiL reservoir solution (26% w/v PEG monomethyl ether ("PEG 
MME") 5000, and 100 mM MES, pH 6.5) suspended over 0.5 mL reservoir solution for about 
one week at 4°C provide diffraction quality crystals. Sitting drops prepared by mixing about 
1 nL of H. influenzae LuxS polypeptide (10 mg/mL in 10 mM HEPES, pH 7.5, 150 mM 
sodium chloride, 10 mM methionine, 1 mM beta-mercaptoethanol) and 1 \\L reservoir solution 
(21% w/v PEG MME 5000, and 100 mM Bis-Tris, pH 6.25) suspended over 0.5 mL reservoir 
solution for about one week at 12°C provide diffraction quality crystals. 

Of course, those having skill in the art will recognize that the above-described 
crystallization conditions can be varied. Exemplary variations, which may be used alone or in 
combination include polypeptide solutions comprising polypeptide concentrations between 
about 3 mg/mL and about 25 mg/mL, buffer concentrations between about 5 mM and about 
200 mM, sodium chloride concentrations between about 0 mM and about 400 mM, pH ranges 
between about 5.0 and about 7.0; and reservoir solutions comprising PEG or PEG MME 
concentrations between about 15% and about 35% (w/v), PEG or PEG MME average 
molecular weights between about 600 and about 10000, ammonium sulfate concentrations 
between about 0 mM and about 300 mM, and temperature ranges between 4° C and 25°C. 
Any buffer solution capable of maintaining the desired pH range may be used, including, for 
example, ACES, ADA, BES, Bis-Tris Propane, Citric Acid, Imidazole, MOPS, PIPES, 
HEPES, MES, Tris, Bis-Tris and cacodylate. 

Heavy-atom derivative crystals can be obtained by soaking native crystals in mother 
liquor containing salts of heavy metal atoms. Native crystals can be soaked with salts of heavy 
metal atoms according to methods known to those of skill in the art including, for instance, 
those disclosed in Stura and Chen, 1992, "Soaking of Crystals" in Crystallization of Nucleic 
Acids and Proteins: A Practical Approach, Ducruix and Giege eds., Oxford University Press. 
Exemplary mother liquor solutions include the sitting drop solutions described in detail above. 
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Heavy-atom derivative crystals can also be obtained from SeMet and/or SeCys mutants, 
as described above for native crystals. 

Some mutant LuxSs may crystallize under slightly different crystallization conditions 
than wild-type LuxS, or under very different crystallization conditions, depending on the 
nature of the mutation, and its location in the LuxS. For example, a non-conservative mutation 
may result in alteration of the hydrophilicity of the mutant, which may in turn make the mutant 
LuxS either more soluble or less soluble than the wild-type LuxS. Typically, if a LuxS 
becomes more hydrophilic as a result of a mutation, it will be more soluble than the wild-type 
LuxS in an aqueous solution and a higher precipitant concentration will be needed to cause it to 
crystallize. Conversely, if a LuxS becomes less hydrophilic as a result of a mutation, it will be 
less soluble in an aqueous solution and a lower precipitant concentration will be needed to 
cause it to crystallize. If the mutation happens to be in a region of the LuxS involved in crystal 
lattice contacts, crystallization conditions may be affected in more unpredictable ways. 

6.6 Characterization of LuxS Crystals 

The dimensions of a unit cell of a crystal are defined by six numbers, the lengths of 
three unique edges, a, b, and c, and three unique angles, a, p, and y. The type of unit cell that 
comprises a crystal is dependent on the values of these variables, as discussed above in Section 
3.2. 

When a crystal is placed in an X-ray beam, the incident X-rays interact with the 
electron cloud of the molecules that make up the crystal, resulting in X-ray scatter. The 
combination of X-ray scatter with the lattice of the crystal gives rise to nonuniformity of the 
scatter; areas of high intensity are called diffracted X-rays. The angle at which diffracted 
beams emerge from the crystal can be computed by treating diffraction as if it were reflection 
from sets of equivalent, parallel planes of atoms in a crystal (Bragg's Law). The most obvious 
sets of planes in a crystal lattice are those that are parallel to the faces of the unit cell. These 
and other sets of planes can be drawn through the lattice points. Each set of planes is 
identified by three indices, hkl. The h index gives the number of parts into which the a edge 
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of the unit cell is cut, the k index gives the number of parts into which the b edge of the unit 
cell is cut, and the 1 index gives the number of parts into which the c edge of the unit cell is cut 
by the set of hkl planes. Thus, for example, the 235 planes cut the a edge of each unit cell into 
halves, the b edge of each unit cell into thirds, and the c edge of each unit cell into fifths. 
5 Planes that are parallel to the be face of the unit cell are the 100 planes; planes that are parallel 
to the ac face of the unit cell are the 010 planes; and planes that are parallel to the ab face of 
the unit cell are the 001 planes. 

When a detector is placed in the path of the diffracted X-rays, in effect cutting into the 
sphere of diffraction, a series of spots, or reflections, are recorded to produce a "still" 

10 diffraction pattern. Each reflection is the result of X-rays reflecting off one set of parallel 

planes, and is characterized by an intensity, which is related to the distribution of molecules in 

pn the unit cell, and hkl indices, which correspond to the parallel planes from which the beam 
producing that spot was reflected. If the crystal is rotated about an axis perpendicular to the 

fU X-ray beam, a large number of reflections is recorded on the detector, resulting in a diffraction 

f§5 pattern as shown in FIG. 2. 

!:JJ The unit cell dimensions and space group of a crystal can be determined from its 

* diffraction pattern. First, the spacing of reflections is inversely proportional to the lengths of 
111 the edges of the unit cell. Therefore, if a diffraction pattern is recorded when the X-ray beam 
*S is perpendicular to a face of the unit cell, two of the unit cell dimensions may be deduced from 
flo the spacing of the reflections in the x and y directions of the detector, the crystal-to-detector 
distance, and the wavelength of the X-rays. Those of skill in the art will appreciate that, in 
order to obtain all three unit cell dimensions, the crystal must be rotated such that the X-ray 
beam is perpendicular to another face of the unit cell. Second, the angles of a unit cell can be 
determined by the angles between lines of spots on the diffraction pattern. Third, the absence 
25 of certain reflections and the repetitive nature of the diffraction pattern, which may be evident 
by visual inspection, indicate the internal symmetry, or space group, of the crystal. Therefore, 
a crystal may be characterized by its unit cell and space group, as well as by its diffraction 
pattern. 
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Once the dimensions of the unit cell are determined, the likely number of polypeptides 
in the asymmetric unit can be deduced from the size of the polypeptide, the density of the 
average LuxS, and the typical solvent content of a LuxS crystal, which is usually in the range 
of 30-70% of the unit cell volume (Matthews, 1968, J. Mol. Biol. 33(2):491-497). 
5 Several SeMet mLuxS crystals were obtained from different bacterial sources. The 

characteristics of these crystals are described below. The H. pylori LuxS crystals of the 
present invention are generally characterized by a diffraction pattern, as shown in FIG. 2. The 
crystals are further characterized by unit cell dimensions and space group symmetry 
information obtained from the diffraction patterns, as described above. The crystals, which 
10 may be native crystals, heavy-atom derivative crystals or co-crystals, have a tetragonal unit 

cell and space group symmetry P432i2. In one form of crystalline H. pylori LuxS, the unit cell 
„ has dimensions of a=71.04 +/- 0.7 A, b=71.04 +/- 0.7 A, c=130.14 +/- 1.3 A. There are 
W likely to be two LuxS molecules in the asymmetric unit, related by an approximate 2-fold axis, 
fyj The crystals appear as long (up to 0.7 mm), thin (typically 0.05 to 0.1 mm wide) spikes. 
fSs LuxS crystals were also obtained from H. influenzae LuxS. The H. influenzaei LuxS 

If crystals, which may be native crystals, heavy-atom derivative crystals or co-crystals, have a 
* tetragonal unit cell and space group symmetry P422i2. In one form of crystalline H. influenzae 
ryi LuxS, the unit cell has dimensions of a=129.59 +/- 1.3 A, b=129.59 +/- 1.3 A, c=53.74 
% +/- 0.5 A. There are likely to be two LuxS molecules in the asymmetric unit, related by an 
So approximate 2-fold axis. The crystals appear as long (up to 0.4 mm), thin (typically 0.05 to 0.1 
mm wide) rods. 

LuxS crystals of the invention were also obtained from D. radiodurans LuxS. In one 
form, the D. radiodurans LuxS crystals, which may be native crystals, heavy-atom derivative 
crystals or co-crystals, have a monoclinic unit cell and space group symmetry P2i. The unit 
25 cell has dimensions of a=43.71 +/- 0.4 A, b=82.18 +/- 0.8 A, c=49.48 +/- 0.5 A and beta 
= 102.78 +/- 1.0 degrees. There are likely to be two LuxS molecules in the asymmetric unit, 
related by an approximate 2-fold axis. The crystals appear as small blocks (typically 0.05 to .1 
mm on a side). In another form of D. radiodurans LuxS crystals, a C2 monoclinic unit cell is 
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observed with dimensions of a=51.19 +/- 0.5 A, b=70.14 +/- 0.7 A, c=49.73 +/- 0.5 A 
and beta = 112.03 +/- 1.1 degree. There is one molecule of LuxS in the asymmetric unit. 

6.7 Collection of Data and Determination of Structure Solutions 
5 The diffraction pattern is related to the three-dimensional shape of the molecule by a 

Fourier transform. The process of determining the solution is in essence a re-focusing of the 
diffracted X-rays to produce a three-dimensional image of the molecule in the crystal. Since 
re-focusing of X-rays cannot be done with a lens at this time, it is done via mathematical 
operations. 

10 The sphere of diffraction has symmetry that depends on the internal symmetry of the 

crystal, which means that certain orientations of the crystal will produce the same set of 
r| reflections. Thus, a crystal with high symmetry has a more repetitive diffraction pattern, and 
^ there are fewer unique reflections that need to be recorded in order to have a complete 
f LJ representation of the diffraction. The goal of data collection, a dataset, is a set of consistently 
f|5 measured, indexed intensities for as many reflections as possible. A complete dataset is 

collected if at least 80% , preferably at least 90% , most preferably at least 95 % of unique 
: reflections are recorded. In one embodiment, a complete dataset is collected using one crystal, 
fil In another embodiment, a complete dataset is collected using more than one crystal of the 
1^ same type. 

2| o Sources of X-rays include, but are not limited to, a rotating anode X-ray generator such 

as a Rigaku RU-200 or a beamline at a synchrotron light source, such as the Advanced Photon 
Source at Argonne National Laboratory. Suitable detectors for recording diffraction patterns 
include, but are not limited to, X-ray sensitive film, multiwire area detectors, image plates 
coated with phosphorus, and CCD cameras. Typically, the detector and the X-ray beam 

25 remain stationary, so that, in order to record diffraction from different parts of the crystal's 
sphere of diffraction, the crystal itself is moved via an automated system of moveable circles 
called a goniostat. 

One of the biggest problems in data collection, particularly from macromolecular 
crystals having a high solvent content, is the rapid degradation of the crystal in the X-ray 
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beam. In order to slow the degradation, data is often collected from a crystal at liquid nitrogen 
temperatures. In order for a crystal to survive the initial exposure to liquid nitrogen, the 
formation of ice within the crystal must be prevented by the use of a cryoprotectant. Suitable 
cryoprotectants include, but are not limited to, low molecular weight polyethylene glycols, 
ethylene glycol, sucrose, glycerol, xylitol, and combinations thereof. Crystals may be soaked 
in a solution comprising the one or more cryoprotectants prior to exposure to liquid nitrogen, 
or the one or more cryoprotectants may be added to the crystallization solution. Data 
collection at liquid nitrogen temperatures may allow the collection of an entire dataset from one 
crystal. 

Once a dataset is collected, the information is used to determine the three-dimensional 
structure of the molecule in the crystal. However, this cannot be done from a single 
measurement of reflection intensities because certain information, known as phase information, 
is lost between the three-dimensional shape of the molecule and its Fourier transform, the 
diffraction pattern. This phase information must be acquired by methods described below in 
order to perform a Fourier transform on the diffraction pattern to obtain the three-dimensional 
structure of the molecule in the crystal. It is the determination of phase information that in 
effect refocuses X-rays to produce the image of the molecule. 

One method of obtaining phase information is by isomorphous replacement, in which 
heavy-atom derivative crystals are used. In this method, the positions of heavy atoms bound to 
the molecules in the heavy-atom derivative crystal are determined, and this information is then 
used to obtain the phase information necessary to elucidate the three-dimensional structure of a 
native crystal. (Blundel et aL, 1976, Protein Crystallography, Academic Press). 

Another method of obtaining phase information is by molecular replacement, which is a 
method of calculating initial phases for a new crystal of a polypeptide whose structure 
coordinates are unknown by orienting and positioning a polypeptide whose structure 
coordinates are known within the unit cell of the new crystal so as to best account for the 
observed diffraction pattern of the new crystal. Phases are then calculated from the oriented 
and positioned polypeptide and combined with observed amplitudes to provide an approximate 
Fourier synthesis of the structure of the molecules comprising the new crystal. (Lattman, 
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1985, Methods in Enzymology 115:55-77; Rossmann, 1972, "The Molecular Replacement 
Method," Int. Sci. Rev. Ser. No. 13, Gordon & Breach, New York). 

A third method of phase determination is multi- wavelength anomalous diffraction or 
MAD. In this method, X-ray diffraction data are collected at several different wavelengths 
from a single crystal containing at least one heavy atom with absorption edges near the energy 
of incoming X-ray radiation. The resonance between X-rays and electron orbitals leads to 
differences in X-ray scattering that permits the locations of the heavy atoms to be identified, 
which in turn provides phase information for a crystal of a polypeptide. A detailed discussion 
of MAD analysis can be found in Hendrickson, 1985, Trans. Am. Crystallogr. Assoc., 21:11; 
Hendrickson et a/., 1990, EMBO J. 9:1665; and Hendrickson, 1991, Science 4:91. 

A fourth method of determining phase information is single wavelength anomalous 
dispersion or SAD. In this technique, X-ray diffraction data are collected at a single 
wavelength from a single native or heavy-atom derivative crystal, and phase information is 
extracted using anomalous scattering information from atoms such as sulfur or chlorine in the 
native crystal or from the heavy atoms in the heavy-atom derivative crystal. The wavelength 
of X-rays used to collect data for this phasing technique need not be close to the absorption 
edge of the anomalous scatterer. A detailed discussion of SAD analysis can be found in 
Brodersen et al., 2000, Acta Cryst., D56:431-441. 

A fifth method of determining phase information is single isomorphous replacement 
with anomalous scattering or SIRAS. This technique combines isomorphous replacement and 
anomalous scattering techniques to provide phase information for a crystal of a polypeptide. 
X-ray diffraction data are collected at a single wavelength, usually from a single heavy-atom 
derivative crystal. Phase information obtained only from the location of the heavy atoms in a 
single heavy-atom derivative crystal leads to an ambiguity in the phase angle, which is resolved 
using anomalous scattering from the heavy atoms. Phase information is therefore extracted 
from both the location of the heavy atoms and from anomalous scattering of the heavy atoms. 
A detailed discussion of SIRAS analysis can be found in North, 1965, Acta Cryst. 18:212-216; 
Matthews, 1966, Acta Cryst. 20:82-86. 



261295-1 



39 



Once phase information is obtained, it is combined with the diffraction data to produce 
an electron density map, an image of the electron clouds that surround the molecules in the unit 
cell. The higher the resolution of the data, the more distinguishable are the features of the 
electron density map, e.g., amino acid side chains and the positions of carbonyl oxygen atoms 
5 in the peptide backbones, because atoms that are closer together are resolvable. A model of 
the macromolecule is then built into the electron density map with the aid of a computer, using 
as a guide all available information, such as the polypeptide sequence and the established rules 
of molecular structure and stereochemistry. Interpreting the electron density map is a process 
of finding the chemically reasonable conformation that fits the map precisely. 
10 After a model is generated, a structure is refined. Refinement is the process of 

minimizing the function O, which is the difference between observed and calculated intensity 
r% values (measured by an R-factor), and which is a function of the position, temperature factor, 
1% and occupancy of each non-hydrogen atom in the model. This usually involves alternate cycles 
f LI of real space refinement, i.e. , calculation of electron density maps and model building, and 
a 5 reciprocal space refinement, i.e. , computational attempts to improve the agreement between 
the original intensity data and intensity data generated from each successive model. 
Refinement ends when the function <D converges on a minimum wherein the model fits the 
fyi electron density map and is stereochemically and confbrmationally reasonable. During 
refinement, ordered solvent molecules are added to the structure. 

J|o 

6.7.1 Structures of LuxS 
The present invention provides, for the first time, the high-resolution three-dimensional 
structure and atomic structure coordinates of crystalline LuxS as determined by X-ray 
crystallography. The specific methods used to obtain the structure coordinates are provided in 
25 the examples, infra. The atomic structure coordinates of four crystalline forms of LuxS are 
appended as Table 7, Table 8, Table 9, and Table 10 (K pylori, K influenzae, D. radiodurans 
P2i, andZ). radiodurans C2, respectively). 

Those having skill in the art will recognize that atomic structure coordinates as 
determined by X-ray crystallography are not without error. Thus, it is to be understood that any 
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set of structure coordinates obtained for crystals of LuxS, whether native crystals, heavy-atom 
derivative crystals or co-crystals, that have a root mean square deviation ("r.m.s.d.") of less than 
or equal to about 2 A when superimposed, using backbone atoms (N, Ca, C and O), on the 
structure coordinates listed in Table 7, Table 8, Table 9, and Table 10, are considered to be 
identical with the structure coordinates listed when at least about 50% to 100% contiguous or 
non-contiguous Ca atoms of LuxS are included in the superposition. 

Referring now to FIG. 6, the overall structure of LuxS is of an "alpha-beta" fold, meaning 
one with approximately the same number of alpha helicies as beta strands. The various regions 
of the molecule are identified by residue in FIG. 1. For example, the designation "3333" in 
FIG. 1 indicates the residues of the 3/10 helix, discussed below. The four beta strands comprise 
an anti-parallel beta sheet, backed on one side by the three longest alpha helicies (#2, #3, and #4). 
Between the sheet and the helicies is the hydrophobic core of the polypeptide. A separate 3/10 
helix is also observed at the N-terminal side of the protein. A striking coordination of three 
histidine residues (His 57, His 61, and His 137) and a cysteine residues (Cys 131) indicate a 
portion of the LuxS active site. Three of these residues (His 57, His 61, and Cys 131) are within 
coordination distance (less than 2.5 A) of a metal ion obsereved in the experimental density map, 
which was determined to be zinc from EXAFS measurements on LuxS crystals (see FIG. 10A). 
This ion is clearly visible in the electron density map calculated from the phases obtained directly 
from the SHARP phasing program. As only three residues and not four are seen to coordinate 
with the metal it must have a lone pair of electrons available for whatever chemical processing 
the enzyme performs in its in vivo function. 

Examination of residual density after modeling of the LuxS protein revealed a patch of 
density near the metal binding site. This was successfully modeled to be a methionine (see FIG. 
1 1). This ligand was seen in both molecules in the asymmetric unit for K pylori and H. 
influenzae and in one of the molecules (B) mD. radiodurans, space group P2i. Methionine was 
present in the polypepide solution used for crystallization in lOmM concentration, added to keep 
Se atoms reduced. Thus there is no indication that methionine plays an in vivo role as a substrate 
for LuxS. Indeed, its sidechain is too short to reach the metal site (see FIG. 1 1). However, 
through modeling with the program SPOCK (Christopher, 1998, Texas A & M University) it is 
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apparent that there is considerable room for a larger amino acid to bind in this region. When S- 
ribosylhomocysteine, a proposed substrate for LuxS (PCT WO 00/321 52), was modeled into the 
amino acid binding site several highly conserved residues of LuxS were identified as significant 
due to their closed proximity to this ligand: Ser 9, His 14, Arg 23, Asp 40, Arg 42, Glu 60, Met 
5 84, Cyc 86, Thr 88, and Tyr 91 . 

The recognition of the amino acid ligand is through its backbone atoms. The carbonyl 
group is within hydrogen bonding distance of the backbone amino proton of residue 81 (2.8 A) 
and the amide group of the Arg68 sidechain (3.1 A). The amino group of the methionine ligand 
is within hydrogen bonding distance of residue 81 backbone carbonyl group (2.8 A), and the 
10 sidechain carbonyl group of Asp 80 (2.6 A). A water mediated hydrogen bond from the 

methionine carbonyl to the carbonyl of Glu 60 is also seen. Glu 60, Arg 68, and Asp 80 are 
^ highly conserved in the LuxS proteins (see FIG. 1) indicating that all possess a capacity to bind 
O an amino acid or derivative thereof 

f y Van der Waal contacts of the methionine are made with both LuxSmolecules in the 

Sf.5 homodimer (see below). The closest approaches to the methionine sidechain from nearby 
W residues sidechain atoms are: 3.3 A for Asp 80, 3.9 A for Glu 60, 3.8 A for Ala 64, and 3.8 A for 
* His 6 1 , all in the same molecule that is binding the methionine backbone; and 3 . 8 A for Tyr 9 1 , 
fi J 3.6 A for Ser 9, 3.3 A for Phe 10, and 3.5 A for residue 7, all from the other molecule in the 
O homodimer. The significance of these contacts are emphasized by the fact that all except residue 
Do 7 are highly conserved in the LuxS motif (see FIG. 1). 

" 3? Three crystalline LuxS polypeptides displayed a homodimer interaction in their 

asymmetric units, and the fourth, the D. radiodurans C2 crystalline polypeptide, displayed a 
dimer with crystallographic symmetry. The dimerization is illustrated in FIG. 7. This 
dimerization was highly consistent between the three structures (alpha carbon superpositions of 
25 the dimers ranging from 1 .0 A 2 for D. radiodurans P2i onto K influenzae to 1 .2 A 2 for D. 

radiodurans Y2\ onto K pylori for residues 1 1-69, 77-118, and 125-152 of each monomer). The 
surface area buried through this interaction is 3930 A 2 for the D. radiodurans P2i LuxS, 
3,195 A 2 for the D. radiodurans C2 crystallographic dimer, 4160 A 2 for K influenzae, and 4180 
A 2 for H. pylori. These are very significant, comprising nearly a quarter of each molecules 
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surface. Thus we propose that LuxS functions as a homodimer in solution. More evidence for 
this is the fact that the methionine ligand binding occurs at the dimer interface (see FIG. 7B) and 
a channel is provided for ligand entrance and exit through the opposing molecule (see FIG. 12). 
Yet more evidence is apparent from plots of electrostatic potential on the monomer 
5 surfaces (FIG. 13 A) and plots of residue conservation (FIG. 13B). It is clear from FIG. 13 A that 
there is a charge complementarity to the surfaces that interact to form the dimer. Also FIG. 13B 
demonstrate that the majority of conserved, non-hydrophobic residues line the dimerization 
domain, which is also the metal and amino acid binding domains, emphasizing its importance. 
Dynamic light scattering studies have given some support to the dimerization hypothesis. In 
10 particular, all four soluble proteins showed monodisperse dimer formation (apparent molecular 

weights of 1 .8 to 2.1 times that of monomer). 
- a5r A structure-based sequence alignment is shown in FIG. 1. The secondary structure 

O assignments for LuxS were obtained using the Kabsch and Sander algorithm (Kabsch and 
f ;j Sander, 1983, "Dictionary of LuxS secondary structure: pattern recognition of hydrogen-bonded 
Us and geometrical features," Biopolymers 22:2577-2637.) as implemented in PROCHECK 
W (Laskowski et ah, 1993, "PROCHECK: a program to check the stereochemical quality of LuxS 
s structures," J. Appl. Cryst. 26:283-291). In the LuxS sequences a hyphen denotes an insertion. 

y 6.8 Atomic Structure Coordinates of LuxS 

52o The atomic structure coordinates can be used in molecular modeling and design, as 

described more fully below. The present invention encompasses the structure coordinates and 
other information, e.g., amino acid sequence, connectivity tables, vector-based 
representations, temperature factors, etc., used to generate the three-dimensional structure of 
the polypeptide for use in the software programs described below and other software 
25 programs. 

The invention encompasses machine readable media embedded with the three- 
dimensional structure of the model described herein, or with portions thereof. As used herein, 
"machine readable medium" refers to any medium that can be read and accessed directly by a 
computer or scanner. Such media include, but are not limited to: magnetic storage media, 

261295-1 43 



• # 



such as floppy discs, hard disc storage medium and magnetic tape; optical storage media such 
as optical discs or CD-ROM; electrical storage media such as RAM or ROM; and hybrids of 
these categories such as magnetic/optical storage media. Such media further include paper on 
which is recorded a representation of the atomic structure coordinates, e.g., Cartesian 
5 coordinates, that can be read by a scanning device and converted into a three-dimensional 
structure with an OCR. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon the atomic structure coordinates of the 
invention or portions thereof and/or X-ray diffraction data. The choice of the data storage 
10 structure will generally be based on the means chosen to access the stored information. In 
addition, a variety of data processor programs and formats can be used to store the sequence 

O and X-ray data information on a computer readable medium. Such formats include, but are not 
limited to, LuxS Data Bank ("PDB") format (Research Collaboratory for Structural 

r f Bioinformatics; http://www.rcsb.Org/pdb/docs/format/pdbguide2.2/guide2.2_frame.html); 

vj5 Cambridge Crystallographic Data Centre format 

5 (http://www.ccdc.cam.ac.uk/sup Structure-data ("SD") file 

format (MDL Information Systems, Inc.; Dalby et a/., 1992, J. Chem. Inf. Comp. Sci. 

[1* 32:244-255), and line-notation, e.g., as used in SMILES (Weininger, 1988, J. Chem. Inf. 

I;! Comp. Sci. 28:31-36). Methods of converting between various formats read by different 

So computer software will be readily apparent to those of skill in the art, e.g. , BABEL (v. 1 .06, 
Walters & Stahl, ®1992, 1993, 1994; http://www.bruneLac.uk/departments/chem/babel.htm.) 
All format representations of the polypeptide coordinates described herein, or portions thereof, 
are contemplated by the present invention. By providing computer readable medium having 
stored thereon the atomic coordinates of the invention, one of skill in the art can routinely 
25 access the atomic coordinates of the invention, or portions thereof, and related information for 
use in modeling and design programs, described in detail below. 

While Cartesian coordinates are important and convenient representations of the three- 
dimensional structure of a polypeptide, those of skill in the art will readily recognize that other 
representations of the structure are also useful. Therefore, the three-dimensional structure of a 
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polypeptide, as discussed herein, includes not only the Cartesian coordinate representation, but 
also all alternative representations of the three-dimensional distribution of atoms. For 
example, atomic coordinates may be represented as a Z-matrix, wherein a first atom of the 
LuxS is chosen, a second atom is placed at a defined distance from the first atom, a third atom 
is placed at a defined distance from the second atom so that it makes a defined angle with the 
first atom. Each subsequent atom is placed at a defined distance from a previously placed atom 
with a specified angle with respect to the third atom, and at a specified torsion angle with 
respect to a fourth atom. Atomic coordinates may also be represented as a Patterson function, 
wherein all interatomic vectors are drawn and are then placed with their tails at the origin. 
This representation is particularly useful for locating heavy atoms in a unit cell. In addition, 
atomic coordinates may be represented as a series of vectors having magnitude and direction 
and drawn from a chosen origin to each atom in the polypeptide structure. Furthermore, the 
positions of atoms in a three-dimensional structure may be represented as fractions of the unit 
cell (fractional coordinates), or in spherical polar coordinates. 

Additional information, such as thermal parameters, which measure the motion of each 
atom in the structure, chain identifiers, which identify the particular chain of a multi-chain 
LuxS in which an atom is located, and connectivity information, which indicates to which 
atoms a particular atom is bonded, is also useful for representing a three-dimensional 
molecular structure. 

6.9 Uses of the Atomic Structure Coordinates 

Structure information, typically in the form of the atomic structure coordinates, can be 
used in a variety of computational or computer-based methods to, for example, design, screen 
for and/or identify compounds that bind the crystallized polypeptide or a portion or fragment 
thereof, or to intelligently design mutants that have altered biological properties. 

In one embodiment, the crystals and structure coordinates obtained therefrom are useful 
for identifying and/or designing compounds that bind LuxS as an approach towards developing 
new therapeutic agents. For example, a high resolution X-ray structure will often show the 
locations of ordered solvent molecules around the LuxS, and in particular at or near putative 
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binding sites on the LuxS. This information can then be used to design molecules that bind 
these sites, the compounds synthesized and tested for binding in biological assays. Travis, 
1993, Science 262:1374. 

In another embodiment, the structure is probed with a plurality of molecules to 
determine their ability to bind to the LuxS at various sites. Such compounds can be used as 
targets or leads in medicinal chemistry efforts to identify, for example, inhibitors of potential 
therapeutic importance. 

In still another embodiment, compounds that can isomerize to short-lived reaction 
intermediates in the chemical reaction of a LuxS-binding compound with LuxS can be 
developed. Thus, the time-dependent analysis of structural changes in LuxS during its 
interaction with other molecules is enabled. The reaction intermediates of LuxS can also be 
deduced from the reaction product in co-complex with LuxS. Such information is useful to 
design improved analogues of known LuxS inhibitors or to design novel classes of inhibitors 
based on the reaction intermediates of LuxS and LuxS-inhibitor co-complexes. This provides a 
novel route for designing LuxS inhibitors with both high specificity and stability. 

In yet another embodiment, the structure can be used to computationally screen small 
molecule data bases for chemical entities or compounds that can bind in whole, or in part, to 
LuxS. In this screening, the quality of fit of such entities or compounds to the binding site may 
be judged either by shape complementarity or by estimated interaction energy. Meng et al , 
1992, J. Comp. Chem. 13:505-524. 

The design of compounds that bind to or inhibit LuxS according to this invention 
generally involves consideration of two factors. First, the compound must be capable of 
physically and structurally associating with LuxS. This association can be covalent or non- 
covalent. For example, covalent interactions may be important for designing irreversible or 
suicide inhibitors of a LuxS. Non-covalent molecular interactions important in the association 
of LuxS with its substrate include hydrogen bonding, ionic interactions and van der Waals and 
hydrophobic interactions. Second, the compound must be able to assume a conformation that 
allows it to associate with LuxS. Although certain portions of the compound will not directly 
participate in this association with LuxS, those portions may still influence the overall 

261295-1 46 



conformation of the molecule. This, in turn, may have a significant impact on potency. Such 
conformational requirements include the overall three-dimensional structure and orientation of 
the chemical group or compound in relation to all or a portion of the binding site, or the 
spacing between functional groups of a compound comprising several chemical groups that 
5 directly interact with LuxS. 

The potential inhibitory or binding effect of a chemical compound on LuxS may be 
analyzed prior to its actual synthesis and testing by the use of computer modeling techniques. 
If the theoretical structure of the given compound suggests insufficient interaction and 
association between it and LuxS, synthesis and testing of the compound is unnecessary. 
10 However, if computer modeling indicates a strong interaction, the molecule may then be 

synthesized and tested for its ability to bind to LuxS and inhibit its activity. In this manner, 
^ synthesis of ineffective compounds may be avoided. 

^3 An inhibitory or other binding compound of LuxS may be computationally evaluated 

f L| and designed by means of a series of steps in which chemical groups or fragments are screened 
f Js and selected for their ability to associate with the individual binding pockets or other areas of 
m LuxS. One skilled in the art may use one of several methods to screen chemical groups or 
3 fragments for their ability to associate with LuxS. This process may begin by visual inspection 
m of, for example, the active site on the computer screen based on the LuxS coordinates. Selected 
'% fragments or chemical groups may then be positioned in a variety of orientations, or docked, 
Qo within an individual binding pocket of LuxS as defined supra. Docking may be accomplished 
using software such as QUANTA and SYBYL, followed by energy minimization and 
molecular dynamics with standard molecular mechanics forcefields, such as CHARMM and 
AMBER. 

Specialized computer programs may also assist in the process of selecting fragments or 
25 chemical groups. These include: 

1. GRID (Goodford, 1985, J. Med. Chem. 28:849-857). GRID is available from 
Oxford University, Oxford, UK; 

2. MCSS (Miranker & Karplus, 1991, LuxSs: Structure, Function and Genetics 
11:29-34). MCSS is available from Molecular Simulations, Burlington, MA; 
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3. AUTODOCK (Goodsell & Olsen, 1990, LuxSs: Structure, Function, and 
Genetics 8:195-202). AUTODOCK is available from Scripps Research Institute, La Jolla, CA; 
and 

4. DOCK (Kuntz et al 9 1982, J. Mol. Biol. 161:269-288). DOCK is available 
from University of California, San Francisco, CA. 

Once suitable chemical groups or fragments have been selected, they can be assembled 
into a single compound or inhibitor. Assembly may proceed by visual inspection of the 
relationship of the fragments to each other in the three-dimensional image displayed on a 
computer screen in relation to the structure coordinates of LuxS. This would be followed by 
manual model building using software such as QUANTA or SYBYL. 

Useful programs to aid one of skill in the art in connecting the individual chemical 
groups or fragments include: 

1. CAVEAT (Bartlett et al , 1989, 'CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules 1 . In Molecular Recognition in 
Chemical and Biological Problems', Special Pub., Royal Chem. Soc. 78:182-196). CAVEAT 
is available from the University of California, Berkeley, CA; 

2. 3D Database systems such as MACCS-3D (MDL Information Systems, San 
Leandro, Calif.). This area is reviewed in Martin, 1992, J. Med. Chem. 35:2145-2154); and 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 

Instead of proceeding to build a LuxS inhibitor in a step-wise fashion one fragment or 
chemical group at a time, as described above, LuxS binding compounds may be designed as a 
whole or 'de novo' using either an empty active site or optionally including some portion(s) of 
a known inhibitor(s). These methods include: 

1. LUDI (Bohm, 1992, J. Comp. Aid. Molec. Design 6:61-78). LUDI is available 
from Molecular Simulations, Inc., San Diego, CA; 

2. LEGEND (Nishibata & Itai, 1991, Tetrahedron 47:8985). LEGEND is 
available from Molecular Simulations, Burlington, Mass.; and 

3. LeapFrog (available from Tripos, Inc., St. Louis, Mo.). 
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Other molecular modeling techniques may also be employed in accordance with this 
invention. See, e.g., Cohen et al, 1990, J. Med. Chem. 33:883-894. See also, Navia & 
Murcko, 1992, Current Opinions in Structural Biology 2:202-210. 

Once a compound has been designed or selected by the above methods, the efficiency 
5 with which that compound may bind to LuxS may be tested and optimized by computational 
evaluation. For example, a compound that has been designed or selected to function as a 
LuxS-inhibitor must also preferably occupy a volume not overlapping the volume occupied by 
the active site residues when the native substrate is bound. An effective LuxS inhibitor must 
preferably demonstrate a relatively small difference in energy between its bound and free states 
10 {i.e. , it must have a small deformation energy of binding). Thus, the most efficient LuxS 

inhibitors should preferably be designed with a deformation energy of binding of not greater 
m than about 10 kcal/mol, preferably, not greater than 7 kcal/moL LuxS inhibitors may interact 
}3 with the LuxS in more than one conformation that is similar in overall binding energy. In 
f [j those cases, the deformation energy of binding is taken to be the difference between the energy 
f 1 5 of the free compound and the average energy of the conformations observed when the inhibitor 
j!:? binds to the enzyme. 

s A compound selected or designed for binding to LuxS may be further computationally 

ry optimized so that in its bound state it would preferably lack repulsive electrostatic interaction 
% with the target LuxS. Such non-complementary electrostatic interactions include repulsive 
Qo charge-charge, dipole-dipole and charge-dipole interactions. Specifically, the sum of all 

electrostatic interactions between the inhibitor and the LuxS when the inhibitor is bound to it 
preferably make a neutral or favorable contribution to the enthalpy of binding. 

Specific computer software is available in the art to evaluate compound deformation 
energy and electrostatic interaction. Examples of programs designed for such uses include: 
25 Gaussian 92, revision C (Frisch, Gaussian, Inc., Pittsburgh, PA. ©1992); AMBER, version 
4.0 (Kollman, University of California at San Francisco, ©1994); QUANT A/CHARMM 
(Molecular Simulations, Inc., Burlington, MA, ©1994); and Insight II/Discover (Biosym 
Technologies Inc., San Diego, CA, ©1994). These programs may be implemented, for 
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instance, using a computer workstation, as are well-known in the art. Other hardware systems 
and software packages will be known to those skilled in the art. 

Once a LuxS-binding compound has been optimally selected or designed, as described 
above, substitutions may then be made in some of its atoms or chemical groups in order to 
5 improve or modify its binding properties. Generally, initial substitutions are conservative, 
i.e. , the replacement group will have approximately the same size, shape, hydrophobicity and 
charge as the original group. One of skill in the art will understand that substitutions known in 
the art to alter conformation should be avoided. Such altered chemical compounds may then 
be analyzed for efficiency of binding to LuxS by the same computer methods described in 
10 detail above. 

Because LuxS may crystallize in more than one crystal form, the structure coordinates 
r i of LuxS, or portions thereof, are particularly useful to solve the structure of those other crystal 
li forms of LuxS. They may also be used to solve the structure of LuxS mutants, LuxS 
f il co-complexes, or of the crystalline form of any other LuxS with significant amino acid 
ms sequence homology to any functional domain of LuxS. 

m One method that may be employed for this purpose is molecular replacement. In this 

: method, the unknown crystal structure, whether it is another crystal form of LuxS, a LuxS 
ry mutant, or a LuxS co-complex, or the crystal of some other LuxS with significant amino acid 
]*! sequence homology to any functional domain of LuxS, may be determined using phase 
j|o information from the LuxS structure coordinates. This method may provide an accurate three- 
dimensional structure for the unknown LuxS in the new crystal more quickly and efficiently 
than attempting to determine such information ab initio. In addition, in accordance with this 
invention, LuxS mutants may be crystallized in co-complex with known LuxS inhibitors. The 
crystal structures of a series of such complexes may then be solved by molecular replacement 
25 and compared with that of wild-type LuxS. Potential sites for modification within the various 
binding sites of the LuxS may thus be identified. This information provides an additional tool 
for determining the most efficient binding interactions, for example, increased hydrophobic 
interactions, between LuxS and a chemical group or compound. 
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If an unknown crystal form has the same space group as and similar cell dimensions to 
the known LuxS crystal form, then the phases derived from the known crystal form can be 
directly applied to the unknown crystal form, and in turn, an electron density map for the 
unknown crystal form can be calculated. Difference electron density maps can then be used to 
examine the differences between the unknown crystal form and the known crystal form. A 
difference electron density map is a subtraction of one electron density map, e.g. , that derived 
from the known crystal form, from another electron density map, e.g. , that derived from the 
unknown crystal form. Therefore, all similar features of the two electron density maps are 
eliminated in the subtraction and only the differences between the two structures remain. For 
example, if the unknown crystal form is of a LuxS co-complex, then a difference electron 
density map between this map and the map derived from the native, uncomplexed crystal will 
ideally show only the electron density of the ligand. Similarly, if amino acid side chains have 
different conformations in the two crystal forms, then those differences will be highlighted by 
peaks (positive electron density) and valleys (negative electron density) in the difference 
electron density map, making the differences between the two crystal forms easy to detect. 
However, if the space groups and/or cell dimensions of the two crystal forms are different, 
then this approach will not work and molecular replacement must be used in order to derive 
phases for the unknown crystal form. 

All of the complexes referred to above may be studied using well-known X-ray 
diffraction techniques and may be refined versus 50 A to 1.5 A or greater resolution X-ray 
data to an R value of about 0.20 or less using computer software, such as CNS (Yale 
University, (c) 1992, distributed by Molecular Simulations, Inc.). See, e.g., Blundel et al. 9 
1976, Protein Crystallography, Academic Press.; Methods in Enzymologv, vol. 114 & 115, 
Wyckoff et al. , eds., Academic Press, 1985. This information may thus be used to optimize 
known classes of LuxS inhibitors, and more importantly, to design and synthesize novel classes 
of LuxS inhibitors. 

The structure coordinates of LuxS mutants will also facilitate the identification of 
related LuxSs or enzymes analogous to LuxS in function, structure or both, thereby further 
leading to novel therapeutic modes for treating or preventing LuxS mediated diseases. 
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Subsets of the atomic structure coordinates can be used in any of the above methods. 
Particularly useful subsets of the coordinates include, but are not limited to, coordinates of 
single domains, coordinates of residues lining an active site, coordinates of residues that 
participate in important LuxS-LuxS contacts at an interface, and Ca coordinates. For instance, 
the structures of LuxS disclosed herein have been used to identify a LuxS metal binding site, a 
LuxS amino acid binding site, a LuxS active site and a LuxS dimerization interface. The 
coordinates of any such domain of a LuxS may be used to design inhibitors that bind to that 
site, even though the LuxS is fully described by a larger set of atomic coordinates. Therefore, 
a set of atomic coordinates that define the entire polypeptide chain, although useful for many 
applications, do not necessarily need to be used for the methods described herein. 

Furthermore, the atomic coordinates of LuxS can be used to design mutant LuxS 
polypeptides. For instance, residues of the active site, metal binding site, substrate binding 
site or dimerization interface of LuxS can be mutated, conservatively or non-conservatively , to 
generate mutants of LuxS with altered biological function. The mutants can have enhanced, 
reduced or no biological activity associated with that function. In particular, mutant LuxS 
polypeptides can be designed that catalyze a reaction producing an altered form of AI2 that 
might be therapeutically useful. Such an AI2 derivative might inhibit the activity of wtLuxS or 
the activity of other molecules in the quorum sensing pathway. Mutation of the dimerization 
interface residues can be used to create mutant LuxS polypeptides can with enhanced or 
reduced dimerization function to study the biological mechanisms of LuxS. 

7. Example 1: Preparation Of Crystals Of H. pylori LuxS protein 

The subsections below describe the production of a polypeptide containing the H. pylori 
LuxS protein, and the preparation and characterization of diffraction quality crystals, heavy- 
atom derivative crystals. 
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7.1 Production and Purification of LuxS 

7.1.1 Preparation Of LuxS Native Crystals 

LuxS was overexpressed in E. coli cells using normal LB media, the cells then 
collected and sonicated to free soluable LuxS protein, passed over a nickel IMAC column to 
separate it from E. coli proteins and eluted, then passed through a size-exclusion column to be 
exchanged into a 150 mM NaCl, 10 mM HEPES pH 7.5, 10 mM methionine, 1 mM beta- 
mercaptoethanol solvent at a concentration of 3 to 30 mg/ml protein. 

7.1.2 Preparation Of Seleno-methionine Crystals 

LuxS with seleno-methionine substitution was prepared in the same manner as for the 
native crystals except a minimum broth with seleno-methionine as the sole source of 
methionine was employed. 

7.2 Analysis And Characterization Of LuxS Crystals 

7.2.1 Diffraction Data Collection 

The crystals were individually harvested from their trays and transferred to a 
cryoprotectant consisting of 80% well solution, 10% glycerol, and 10% ethylene glycol. After 
about 30 seconds the crystal was collected and frozen in a stream of nitrogen gas that had been 
cooled to 95 degrees Kelvin. The crystals were then transferred in liquid nitrogen to the 
Advanced Photon Source (Argonne National Laboratory) where a two wavelength MAD 
experiment was collected, a peak wavelength and a high energy remote wavelength. 

7.2.2 Structure Determination 

The MAD data was indexed and intergrated using the program Denzo and merged and 
scaled using the program Scalepack. The program SnB was then used to determine the location 
of Selenium-methionine Se's based on the peak wavelength data. These Se sites (10 of 12 Se 
were identified) were refined and phase information for the protein obtained using the program 
SHARP. Solomon solvent flattening of the data was subsequently employed in SHARP. The 
resulting map was viewed in the program O and found to be of excellent quality with 
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essentially all of both of the proteins in the asymmetric unit, main chain and sidechains, easily 
visible. This map was modeled using O to give the position of nearly all of the residues: 
residues 3 through 160 (molecule A) and 3 through 161 (molecule B) of H. pylori LuxS. The 
model was refined using the program CNX. 

7.2.3 Structure Analyses 
The stereochemical quality of the atomic model was monitored using PROCHECK 
(Laskowski et al. , 1993 , "PROCHECK: a program to check the stereochemical quality of 
LuxS structures," J. AppL Cryst. 26:283-291). As defined in PROCHECK, for H. pylori 
LuxS, there are 87.7% (molecule A) and 86.2% (molecule B) of the residues in the model have 
main-chain torsion angles in the most favored Ramachandran regions . No residues fall in the 
disallowed region. In H. pylori, there are only two residues of molecule A and none of 
molecule B that fall in the generously allowed regions. The overall G-factor scores are 0.16 
(H. pylori, molecule A) and 0.13 (H. pylori, molecule B). 

8. Example 2: Preparation Of Crystals Of H. influenzaei LuxS protein 

The subsections below describe the production of a polypeptide containing the H. 
influenzae LuxS protein, and the preparation and characterization of diffraction quality 
crystals, heavy-atom derivative crystals. 

8.1 Production and Purification of LuxS 

8.1.1 Preparation Of LuxS Native Crystals 

LuxS was overexpressed in E. coli cells using normal LB media, the cells then 
collected and sonicated to free soluable LuxS protein, passed over a nickel IMAC column to 
separate it from E. coli proteins and eluted, then passed through a size-exclusion column to be 
exchanged into a 150 mM NaCl, 10 mM HEPES pH 7.5, 10 mM methionine, 1 mM beta- 
mercaptoethanol solvent at a concentration of 3 to 30 mg/ml protein. 
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8.1.2 Preparation Of Seleno-methionine Crystals 
LuxS with seleno-methionine substitution was prepared in the same manner as for the 
native crystals except a minimum broth with seleno-methionine as the sole source of 
methionine was employed. 

8.2 Analysis And Characterization Of LuxS Crystals 

8.2.1 Diffraction Data Collection 

The crystals were individually harvested from their trays and transferred to a 
cryoprotectant consisting of 80% well solution, 10% glycerol, and 10% ethylene glycol. After 
about 30 seconds the crystal was collected and frozen in a stream of nitrogen gas that had been 
cooled to 95 degrees Kelvin. The crystals were then transferred in liquid nitrogen to the 
Advanced Photon Source (Argonne National Laboratory) where a two wavelength MAD 
experiment was collected, a peak wavelength and a high energy remote wavelength. 

8.2.2 Structure Determination 

The MAD data was indexed and intergrated using the program Denzo and merged and 
scaled using the program Scalepack. The program SnB was then used to determine the location 
of Selenium-methionine Se's based on the peak wavelength data. These Se sites (12 of 14 were 
found) were refined and phase information for the protein obtained using the program SHARP. 
Solomon solvent flattening of the data was subsequently employed in SHARP. The resulting 
map was viewed in the program O and found to be of excellent quality with essentially all of 
both of the proteins in the asymmetric unit, main chain and sidechains, easily visible. This map 
was modeled using O to give the position of nearly all of the residues: residues 6 through 164 
(molecule A) and 6 through 166 (molecule B) of H. influenzae LuxS. The model was refined 
using the program CNX. 
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8.2.3 Structure Analyses 
The stereochemical quality of the atomic model was monitored using PROCHECK 
(Laskowski et ah , 1993, "PROCHECK: a program to check the stereochemical quality of 
LuxS structures," J. AppL Cryst. 26:283-291). As defined in PROCHECK, for H. influenzae 
LuxS there are 92.8% (molecule A) and 90.8% (molecule B) of the residues in the model have 
main-chain torsion angles in the most favored Ramachandran regions . There is only one 
residue that falls in the disallowed region (H. Influenzae, molecule B). H. influenzae has one 
residue of each molecule falling in the generously allowed regions. The overall G-factor 
scores are 0.25 (H. influenzae, molecules A and B). 

9. Example 3: Preparation Of P2i Crystals Of D. radioduransi LuxS protein 

The subsections below describe the production of a polypeptide containing the D. 
radiodurans LuxS protein, and the preparation and characterization of diffraction quality 
crystals, heavy-atom derivative crystals. 

9.1 Production and Purification of LuxS 

9.1.1 Preparation Of LuxS Native Crystals 

LuxS was overexpressed in E. coli cells using normal LB media, the cells then 
collected and sonicated to free soluable LuxS protein, passed over a nickel IMAC column to 
separate it from E. coli proteins and eluted, then passed through a size-exclusion column to be 
exchanged into a 150 mM NaCl, 10 mM HEPES pH 7.5, 10 mM methionine, 1 mM beta- 
mercaptoethanol solvent at a concentration of 3 to 30 mg/ml protein. 

9.1.2 Preparation Of Seleno-methionine Crystals 

LuxS with seleno-methionine substitution was prepared in the same manner as for the 
native crystals except a minimum broth with seleno-methionine as the sole source of 
methionine was employed. 
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9.2 Analysis And Characterization Of LuxS Crystals 

9.2.1 Diffraction Data Collection 

The crystals were individually harvested from their trays and transferred to a 
cryoprotectant consisting of 80% well solution, 10% glycerol, and 10% ethylene glycol. After 
about 30 seconds the crystal was collected and frozen in a stream of nitrogen gas that had been 
cooled to 95 degrees Kelvin. The crystals were then transferred in liquid nitrogen to the 
Advanced Photon Source (Argonne National Laboratory) where a two wavelength MAD 
experiment was collected, a peak wavelength and a high energy remote wavelength. 

9.2.2 Structure Determination 

The MAD data was indexed and intergrated using the program Denzo and merged and 
scaled using the program Scalepack. The program SnB was then used to determine the location 
of Selenium-methionine Se's based on the peak wavelength data. These Se sites (11 of 14 were 
seen) were refined and phase information for the protein obtained using the program SHARP. 
Solomon solvent flattening of the data was subsequently employed in SHARP. The resulting 
map was viewed in the program O and found to be of excellent quality with essentially all of 
both of the proteins in the asymmetric unit, main chain and sidechains, easily visible. This map 
was modeled using O to give the position of nearly all of the residues: residues 6 through 162 
(molecule A) and 8 through 162 (molecule B) of D. radiodurans LuxS. The model was refined 
using the program CNX. 

9.2.3 Structure Analyses 

The stereochemical quality of the atomic model was monitored using PROCHECK 
(Laskowski et al , 1993, "PROCHECK: a program to check the stereochemical quality of 
LuxS structures," J. Appl. Cryst. 26:283-291). As defined in PROCHECK, D. radiodurans 
LuxS has 92.2% (molecule A) and 89.7% (molecule B) of the residues in the model have 
main-chain torsion angles in the most favored Ramachandran regions. There were no residues 
that fell in the disallowed region, nor any thai fell in the generously allowed regions. The 
overall G-factor scores are 0.15 for both molecules A and B). 
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10. Example 4: Preparation Of C2 Crystals Of D. radiodurans LuxS protein 

The subsections below describe the production of a polypeptide containing the D. 
radiodurans LuxS protein, and the preparation and characterization of diffraction quality 
5 crystals, heavy -atom derivative crystals. 

10.1 Production and Purification of LuxS 

10.1.1 Preparation Of LuxS Native Crystals 
LuxS was overexpressed in E. coli cells using normal LB media, the cells then 
10 collected and sonicated to free soluable LuxS protein, passed over a nickel IMAC column to 
separate it from E. coli proteins and eluted, then passed through a size-exclusion column to be 
exchanged into a 150 mM NaCl, 10 mM HEPES pH 7.5, 10 mM methionine, 1 mM beta- 
mercaptoethanol solvent at a concentration of 5 to 25 mg/ml protein. 

€ts 10. 1 .2 Preparation Of Seleno-methionine Crystals 

hi LuxS with seleno-methionine substitution was prepared in the same manner as for the 

^ native crystals except a minimum broth with seleno-methionine as the sole source of 
^ methionine was employed. 

So 10.2 Analysis And Characterization Of LuxS Crystals 

B 10.2.1 Diffraction Data Collection 

The crystals were individually harvested from their trays and transferred to a 
cryoprotectant consisting of 80% well solution, 10% glycerol, and 10% ethylene glycol. After 
about 30 seconds the crystal was collected and frozen in a stream of nitrogen gas that had been 
25 cooled to 95 degrees Kelvin. The crystals were then transferred in liquid nitrogen to the 
Advanced Photon Source (Argonne National Laboratory) where a two wavelength MAD 
experiment was collected, a peak wavelength and a high energy remote wavelength. 
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10.2.2 Structure Determination 
This data was not sufficient on its own to solve the structure. However, the LuxS model 
from the P2i structure from D. radiodurans was used along with the molecular replacement 
program EPMR to give a solution using the high energy remote data calculated to averagae 
5 anomolous differences. The resulting map was viewed in the program O and found to be of 
excellent quality with essentially all of both of the proteins in the asymmetric unit, main chain 
and sidechains, easily visible. This map was modeled using O to give the position of nearly all 
of the residues: residues 12 through 162. The model was refined using the program CNX. 

10 10.2.3 Structure Analyses 

The stereochemical quality of the atomic model was monitored using PROCHECK 
(Laskowski et al , 1993, "PROCHECK: a program to check the stereochemical quality of 
LuxS structures/* J. Appl. Cryst. 26:283-291). As defined in PROCHECK, 91.8% of the 
fLS residues in the model of D. radiodurans C2 LuxS have main chain torsion angles in the most 
fis favored Ramachandran regions. No residues fell in the disallowed region, and no residues fell 
ti in the generously allowed regions. The overall G-factor score is 0.23. 
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Table 2 Crystallographic Statistics 
Data Set 



LuxS (H. pylon) 



LuxS (H. influenzae) 



Wavelen Resolution (A) 

L1 (anom) 0.9795 40.0-2.40 

L2(anom) 0.9641 40.0-2.40 

L1 (no anom)0.9795 40.0-2.40 

L1 (anom) 0.9795 40.0-2.10 

L2(anom) 0.9641 40.0-2.10 

L1(noanom)0.9795 40.0-2.10 



LuxS (D. radiodurans) L1 (anom) 0.9795 40.0-1 .80 

P2 1 sp. gp L2(anom) 0.9641 40.0-1.80 

L1 (no anom)0.9795 40.0-1 .80 

LuxS (D. radiodurans) L2(noanom) 0.9641 40.0-2.10 
C2 sp. gp 



Completeness (%) 

Overall/ 

Outer Shell 

100.0/100.0 

100.0/100.0 

100.0/100.0 

99.6/98.6 
99.4/98.1 
99.8/99.5 

86.0/100.0 

85.6/99.2 

86.5/99.5 

98.1/100.0 



Rsym (%) 

Overall/ 

Outer Shell 

5.6/19.8 

5.5/26.0 

8.8/22.5 

6.4/18.4 
5.7/20.3 
6.9/22.0 

7.0/15.3 
5.8/19.6 
7.3/21.5 

5.3/7.4 



MAD F.O.M. 
MR Rfact. 
MRc.c. 

Refinement Statistics 



H. pylori 
H. influenzae 
D. radiodurans P2^ 
D. radiodurans C2 



RMS Deviations 



H. pylori 
H. influenzae 
D. radiodurans P2i 
D. radiodurans C2 



H. pylori 
0.66 



H. influenzae 
0.68 



Rfactor 



Rfree 



0.217 
0.206 
0.188 
0.212 



0.282 
0.236 
0.232 
0.271 



Bond Length (A) Bond Angles (°) 



0.013 
0.010 
0.021 
0.010 



1.8 
1.6 
1.9 
1.6 



Data Collection and MAD Phasing Summary 



D. radiodurans 

0.60 
38.9 
0.54 



Waters 

23 
57 
124 
52 

Thermal 
parameters (A 2 ) 

1.3 
3.0 
2.0 
4.3 



Resolution 

30.0-2.40 
30.0-2.10 
30.0-1.80 
30.0-2.10 



Rs^, = 100 x EhS, | L(h)- < 1(h) > | / EhEiL(h) 

I(+h) and I(-h) processed as independent reflections. Anomalous scattering contributions were included. 
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Figure of merit: |P(<|))exp(i (|))d (() / JP((J))d(<j)), where P is the probability distribution of the phase angle <|). 
R-factor = 100 x Eh | |Fobs(h)| - |Fcaic(h)| i /Sh|Fob S (h)| for reflections with Fobs > 2 a. 
For bonded LuxS atoms. 

Free R-value (Briinger, 1992, "Free R value: a novel statistical quantity for assessing the accuracy of 
5 crystal structures," Nature 355:472-475. ) determined from 10% of the data. 

Table 7, Table 8, Table 9, and Table 10 provide the atomic structure coordinates of 
LuxS. In Tables 7 through 9 coordinates for the two LuxS molecules comprising the 
asymmetric unit are provided, one labeled molecule A and the other molecule B. Table 7 
10 presents the structural coordinates of H. pylori LuxS. Table 8 presents the structural 

coordinates of crystalline H. influenzae LuxS, space group P2i. Table 9 presents the structural 
coordinates of crystalline Z). radiodurans LuxS, space group C2. 

fil The following abbreviations are used in Table 7, Table 8, Table 9, and Table 10: 

Hs " Atom Type " refers to the element whose coordinates are provided. The first letter in 

Ul the column defines the element. 
I" " A. A. " refers to amino acid. 

[J J " X, Y and Z " provide the Cartesian coordinates of the element. 

O "B" is a thermal factor that measures movement of the atom around its atomic center. 

r:Bo " OCC " refers to occupancy, and represents the percentage of time the atom type 

occupies the particular coordinate. OCC values range from 0 to 1, with 1 being 100%. 

" PRT1 " or " PRT2 " relate to occupancy, with PRT1 designating the coordinates of the 
atom when in the first conformation and PRT2 designating the coordinates of the atom when in 
the second or alternate conformation. 
25 Structures coordinates for LuxS according to Table 7, 8, 9, or 10 may be modified by 

mathematical manipulation. Such manipulations include, but are not limited to, 
crystallographic permutations of the raw structure coordinates, fractionalization of the raw 
structure coordinates, integer additions or subtractions to sets of the raw structure coordinates, 
inversion of the raw structure coordinates and any combination of the above. 
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The present invention is not to be limited in scope by the exemplified embodiments, 
which are intended as illustrations of single aspects of the invention. Indeed, various 
modifications of the invention in addition to those described herein will become apparent to 
those having skill in the art from the foregoing description and accompanying drawings. Such 
5 modifications are intended to fall with in the scope of the appended claims. 

All references cited herein are hereby incorporated by reference in their entirety. 
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